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Please amend claim 6 as follows: 

At line 1, delete "of any of claims 1-5" and insert -according to claim 

Please amend claim 7 as follows: 

At line 1, delete "accordin" and insert —according—. 
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At line 1, delete "claims 1-6" and insert -claim 1~. 

Please amend claim 9 as follows: 

At line 1, delete "any of claims 1-8" and insert —claim 1—. 

Please amend claim 10 as follows: 

At line 1, delete "any of claims 1-10" and insert -claim 1-. 

Please amend claim 11 as follows: 

At line 1, delete "accordint to claims 1-10" and insert -according to claim 

1-. 
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At line 1, delete "any of claims 1-12" and insert —claim 1-. 
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At line 1, delete "claims 1 to 15" and insert -claim 1-. 

Please amend claim 17 as follows: 

At line 1, delete "any of claims 1 to 16" and insert -claim 1~. 

Please amend claim 18 as follows: 

At line 2, delete "any one of claims 1 to 17" and insert -claim 1-. 
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Please amend claim 24 as follows: 

At line 2, delete "any one of claims 1 to 17" and insert --claim 1— 



Please amend claim 27 as follows: 

At line 2, delete "any of claims 1 to 17" and insert --claim 1-. 

Please amend claim 29 as follows: 

At lines 2 and 3, delete "any of claims 1 to 17" and insert —claim 1— . 

Please amend claim 31 as follows: 

At line 2, delete "any of claims 1 to 17" and insert —claim 1~. 
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Title: a-amylase variants 

FIELiD OF THE INVENTION 

The present invention relates to novel variants of parent 
5 Termamyl-like a-amylases V7ith altered properties relative of the 
parent alpha-amylase . Said properties include increased 
stability, e,g,, at acidic pH, e-^./ at low calcium 
concentrations and/or high temperatures. Suach variants are 
suitable for a number of applications , in particular, industrial 
10 starch processing (e.g., starch liquefaction or 

saccharif ication) . 

BACKGROUND OP THE INVENTION 

a-Amylases (a- 1 , 4 -glucan-4 -glucanohydrolasas , EC 3,2.1.1) 
^ 15 constitute a group of enzymes which catalyze hydrolysis of 
starch and other linear and branched 1 , 4 -glucosidic oligo- and 
^11 polysaccharides . 

There is a very extensive body of patent and scientific 
literature relating to this industrially very important class of 
4,20 enzymes, A number of a-amylase such as Termamyl-like a- 
amylases variants are known from, e.g., WO 90/11352/ WO 
95/10603, WO 95/26397, WO 96/23873 and WO 96/23874. 

WO 96/23 874 provides the three-dimensional. X-ray crystal 
structural data for a Termamyl-like a-amylase which consists of 
25 the 3 00 N- terminal amino acid residues of the S, 
sinyloliqu^faclens a-amylase and amino acids 301-483 of the C- 
terminal end of the 5, llch^rLlfojrmls a-amylase comprising the 
amino acid sequence (the latter being available commercially 
under the tradename Termamyl™) , and which is thus closely 
30 related to the industrially important Bacillus a-amylases (which 
in the present context are embraced within the meaning of the 
term "Termamyl-like a-amylases", and which include, inter alia., 
the B. lichenifojrmls, B, ^xnyloliquBfaciens and 
stearothermophilus a-amylases) , WO 96/23874 further describes 
35 methodology for designing, on the basis of an analysis of the 
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strucrure of a parent Termamyl-like a-araylase, variants of the 
parent Termamyl-like a-amylase which exhibit altered properties 
relative to the parent. 

5 BRIEF DISCLOSURE OF THE INVENTION 

The present invention relates to novel a-amylolytic 
variants (mutants) of a Termamyl-like a-amylase, m particular 
variants exhibiting increased stability at acidic pH at high 
temperatures (relative to the parent) which are advantageous m 
10 connection with, e.g., the industrial processing of starch 
(starch liquefaction, saccharif ication and the like) as 
described in US Patent No. 3,912,590 and SP patent publications 
Nos. 252,730 and 63,909. 

15 Starch conversion 

A "traditional" starch conversion process degrading starch 
to lower molecular weight carbohydrate components such as 
sugars or fat replacers includes a debranching step. 

20 ^^Starch to sugar" conversion 

In the case of converting starch into a sugar the starch is 
depolymerized . A such depolymerization process consists of a 
pretreatment step and two or three consecutive process steps, 
viz- a liquefaction process, a saccharif ication process and 
25 dependent on the desired end product optionally an 
ieomerizat ion process, 

Pre-treatment of native starch 

Native starch consists of microscopic granules which are 

30 insoluble in water at room temperature. When an aqueous starch 
slurry is heated, the granules swell and eventually burst, 
dispersing the starch molecules into the solution. During this 
'"gelatinization" process there is a dramatic increase in 
viscosity. As the solids level is 30-40% in a typically 

35 industrial process, the starch has to be thinned or "^liquefied" 
so that it can be handled. This reduction in viscosity is today 
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mostly obtained by enzymatic degradation. 
Liquefaction 

During the liquefaction step, the long chained starch is 
5 degraded into branched and linear shorter units (maltodextrins) 
by an a-amylase (e.g., Termanayl™ SEQ ID NO; 4 herein). The 
liquefaction process is carried out at 105-110°C for 5 to 10 
minutes followed by 1-2 hours at 95*C. The pH lies between 5.5 
and 6-2, In order to ensure an optimal enzyme stability under 
10 these conditions, 1 mM of calcium is added (40 ppm free calcium 
ions) . After this treatment the liquefied starch will have a 
'"dextrose equivalent" (DE) of 10-15. 

Saccharif ication 

15 After the liquefaction process the maltodextrins are 

converted into dextrose by addition of a glucoamylase (e.g., 
AMG™) and a debranching enzyme, such as an isoamylase (US 
Patent 4,335,208) or a pullulanase {^.g., Promozyme™) (US 
Patent 

20 4/560,651) . Before this step the pH is reduced to a value below 
4.5, maintaining the high temperature (above 95°C) to 

inactivate the liquefying a-amylase to reduce the formation of 
short oligosaccharide called '^panose precursors" which cannot 
be hydrolyzed properly by the debranching enzyme. 
25 The temperature is lowered to 60*^0, and glucoamylase and 

debranching enzyme are added. The saccharif ication process 
proceeds for 24-72 hours. 

Normally, when denaturing the a-amylase after the 
liquefaction step about 0.2-0.5% of the saccharif ication 
30 product is the branched trisaccharide S"-a-glucosyl maltose 
(panose) which cannot be degraded by a pullulanase. If active 
amylase from the liquefaction step is present during 
saccharif ication (i.e., no denaturing), this level can be as 
high as 1-2%, which is highly undesirable as it lowers the 
35 saccharif ication yield significantly. 
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IsoKierizatlon 

When the desired final sugar product is e.g, high fructose 
syrup the dextrose syrup may be converted into fructose, 
5 After the saccharif ication process the pH is increased to a 
value in the range of 6-6, preferably pH 7.5, and the calcium 
is removed by ion exchange. The dextrose syrup is then 
converted into high fructose syrup using, &.g., an immmobilised 
glucoseisomerase (such as Sweet zyme''^) . 

.0 

In the context of the invention the term "acidic pH'' means a 
pH below 7.0, especially below the pH range in which industrial 
starch liquefaction processes are traditionally performed, as 
described above, which is between pH 5 . 5 and €.2. 
15 In the context of the present invention the term ^^low 

f'^ Calcium concentration^ means concentrations below the normal 
level used in traditional industrial starch liquefaction 
■^=4 processes, such as between Q-40 ppm, preferably between 10-30 
■;\ ppm, such as between 15-25 ppm Calcium, Normal concentrations 
\^2o vary depending of the concentration of free Ca^"" in rhe corn. 
Normally a dosage corresponding to ImM (4 0ppm) is added which 
together with the level in corn gives between 4 0 and BO ppm free 

In the context of the invention the term ^^high temperature" 
25 means temperatures between 95 and I60*C, especially the 
temperature range in which industrial starch liquefaction 
processes are normally performed, which is between 95 and 105°C. 

The invention further relates to DNA constructs encoding 
30 variants of the invention, to methods for preparing variants of 
the invention, and to the use of variants of the invention, 
alone or in combination with other a-amylolytic enzymes, in 
various industrial processes, in particular starch liquefaction. 

35 Nomenclature 

In the present description and claims, the conventional one- 
letter and three- letter codes for amino acid residues are used. 
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For ease of reference^ a-amylase variants of the invention are 
described by use of th.e following nomenclature: 
Original amino acid (s) :position (s) : substituted amino acid(s) 

According to this nomenclature, for instance the 
5 substitution of alanine for asparagine in position 3 0 is shown 
as : 

Ala3 0Asn or A3 ON 

a deletion of alanine in the same position is shown as: 
Ala30* or A30* 

10 and insertion of an additional amino acid residue, such as 
lysine / is shown as: 

Ala3 0AlaLys or A3 OAK 

A deletion of a consecutive stretch of amino acid residues, 
such as amino acid residues 30-33, is indicated as (30-33)* or 
- 15 A(A30-N33) - 

Where a specific a-amylase contains a "deletion" in 

comparison with other a-amylases and an insertion is made in 
such a position this is indicated as: 
*3 6Asp or ^3 6D 
20 for insertion of an aspartic acid in position 3 6 

Multiple mutations are separated by plus signs, i.e.: 

Ala30Aap + Glu34Ser or A30N+E343 

representing mutations in positions 30 and 34 substituting 
alanine and glutamic acid for asparagine and serine, 
25 respectively. Multiple mutation may also be separated as 
follows, i.e., meaning the same as the plus sign: 
Ala3 0Asp/Glu3 4Ser or A3 0N/E34S 

When one or more alternative ammo acid residues may be 
inserted in a given position it is indicated as 
30 A3 0N,E or 

A3 ON or A3 0E 

Furthermore, when a position suitable for modification is 
identified herein without any specific modification being 
suggested, it is zo be understood that any amino acxd residue 
35 may be substituted for the amino acid residue present in the 
position. Thus, for instance, when a modification of an alanine 



15. NOV. 1999 1 7:44 NOVO N0R3ISK PAT 3P 44493256 

S7Q9.204-W0 



NO. 2733 P. 8 



6 

in position 3 0 is mentioned, but not specified, it is to be 
understood that the alanine may be deleted or substituted for 
any other amino acid, i.e., any one of: 
R,N,D,A.C,Q, E,G,H, I,L,K,M, F,P,S,T, Y, V. 

5 

BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 is an alignment of the amino acid sequences of six 

parent Termamyl -like a-amylases in the context of the invention. 

The numbers on the Extreme left designate the respective amino 
10 acid sequences as follows: 

1 : SEQ ID NO ; 2 , 

2 ; amylase 

3 : SEQ ID NO : 1 , 

4 : SEQ ID NO: 5, 
15 5t SEQ ID NO: 4, 

6 : SEQ ID NO : 3 . 

Figure 2 shows the PCR strategy used in Example 1 - 

20 DETAIIiED DISCLOSURE OF THE INVENTION 

The Termamyl - like a-amyla3_e 

It is well known that a number of a-amylases produced by 
Beicillus app. are highly homologous on the amino acid level. For 
instance, the B. lichenltormis a-amylase comprising the amino 

25 acid sequence shown in SEQ ID NO ; 4 (commercially available as 
Termamyl™) has been found to be about 6 9% homologous with the 
amyloliquofaalens a-amylase comprising the amino acid 
sequence shown in SEQ ID NO: 5 and about 79% homologous with the 
steajrotihejrmophilus a-amylase comprising the amino acid 

30 sequence shown in SEQ ID NO; 3- Further homologous a-amylases 
include an a-amylase derived from a strain of the Bacillus sp . 
NCIB 12289, NCIB 12512, NCIB 12513 or DSM 9375, all of which are 
described in detail in WO 95/26397, and the a-amylase described 
by Tsukamoto et al . , Biochemical and Biophysical Research 

35 Communications . 151 (1988), pp. 25-31. 
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Still further homologous a- amylases include the a- amylase 
produced by the B. llcheniformis strain described in EP 025266:6 
(ATCC 27811) , and the a-amylases identified in WO 91/00353 and 
WO 94/18314- Other commercial Termamyl-like B. llcheniformis 
5 a-amylases are Optitherm™ and Takatherm*^^ (available from 
Solvay) , Maxamyl'^^ (available from Gist -brocades/Genencor ) , 
Spezym AA™ and Spezyme Delta AA'" (available from Genencor) , and 
Keistase™ (available from Daiwa) . 

Because of the substantial homology found between these a- 
10 amylases, they are considered to belong to the same class of a- 
amylaseS; namely the class of "Termamyl-like a-amylases". 

Accordingly, in the present context, the term "Termamyl-like 
a-amylase" is intended to indicate an a-amylase which, at the 
amino acid levels exhibits a substantial homology to Termamyl™, 
15 i.e., the B, llcheniformis a-amylase having the amino acid 
sequence shown in SEQ ID NO: 4 herein. In other words, a 
Termamyl-like a-amylase is an a-amylase which has the amino 
acid sequence shown in SEQ ID NOS : 1, 2, 3, 4, 5, 6, 7 or S 
herein, and the amino acid sequence shown in SEQ ID NO: 1 of WO 
20 95/263 97 (the same as the amino acid sequence shown as SEQ ID 
NO: 7 herein) or in SEQ ID NO: 2 of WO 95/2 63 97 (the same as the 
amino acid sequence shown as SEQ ID NO: 8 herein) or m 
Tsukamoto et al . , 1988, (which amino acid sequence is shown in 
SEQ ID NO: 6 herein) or i) which displays at least 60% homology 
25 (identity) , preferred at least 70%, more preferred at least 75%, 
even more preferred at least 80%, especially at least SS%, 
especially preferred at least 90%, especially at least 95%, even 
especially more preferred at least 97%, especially at least 99% 
homology with at least one of said amino acid sequences shown in 
30 SEQ ID NOS 1: or 2 or 3 or 4 or 5 or 6 or 7 or 8 and/or ii) 
displays immunological cross-reactivity with an antibody raised 
against one or more of said a-amylases, and/or iii) is encoded 
by a DNA sequence which hybridizes, under the low to very high 
stringency conditions (said conditions described below) to the 
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DNA sequences encoding the above -specified a-arnylases which are 
apparent from SEQ ID NOS : 9, 10, 11, 12, and 22, respectively, 
of the present application (which encodes the amino acid 
sequences shown in SEQ ID NOS; 1, 2, 3, 4, and 5 herein, 
5 respectively), from SEQ ID NO: 4 of WO 95/26397 (which DNA 
sequence, together with the stop codon TAA, is shown in SEQ ID 
NO: 13 herein and encodes the amino acid sequence shown in SEQ 
ID NO: 8 herein) and from SEQ ID NO: 5 of WO 95/2 63 97 (shown in 
SEQ ID NO: 14 herein) , respectively. 
10 In connection with property i) , the "homology" (identity) 

may be determined by use of any conventional algorithm, 
preferably by use of the gap progamme from the GCG package 
version 8 (August 1994) usxng default values for gap penalties, 
i.e., a gap creation penalty of 3 . 0 and gap extension penalty of 
15 0,1 (Genetic Computer Group (1991) Programme Manual for the GCG 
Package, version 8, 575 Science Drive, Madison, Wisconsin, USA 
53711) , 

The parent Termamyl-like a-amylase backbone may in an 
embodiment have an amino acid sequence which has a degree of 
20 identity to SEQ ID NO: 4 of ar least 65%, preferably at least 
70%, preferably at least 75%, more preferably at least 80%, more 
preferably at least 85%, even more preferably at least about 
90%, even more preferably at least 95%, even more preferably at 
least 97%, and even more preferably at least 9 9% identity 
25 determined as described above 

A structural alignment between Termamyl® (SEQ ID NO: 4) and 
a Termamyl-like a-amylase may be used to identify 
equivalent /corresponding positions in other Termamyl-like a- 
amylases . One method of obtaining said structural alignment is 
3 0 to use the Pile Up programme from the GCG package using default 
values of gap penalties, i.e., a gap creation penalty of 3 . 0 and 
gap extension penalty of 0,1. Other structural alignment methods 
include the hydrophobic cluster analysis (Gaboriaud et al , , 
(1987), FEBS LETTERS 224, pp. 149-155) and reverse threading 
35 (Huber, T / Torda, AE, PROTEIN SCIENCE Vol. 7, No. 1 pp. 142- 
149 (1998) , 
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For example, the corresponding positions, of target residues 
found in the C-domain of th^ B, llcheniforwis a-amylase, in the 
amino acid sequences of a number of Termamyl -like a-amylases 
which have already been mentioned are as follows: 



Termamyl-like a-amylase 



10 



B, llch. (SEQ ID NO: 4) 
B. amylo. (SEQ ID NO: 5) 
B. atearo, (SEQ ID NO: 3) 
Sac.WO 95/2S397 (SEQ ID NO: 2) 
Sac.WO 95/26397 (SEQ ID NO: 1) 



Y358 E376 S417 A420 
S256 Y35a E376 S417 A420 

Y361 

Y363 S419 

Y3S3 



15 



20 



As will be described further below murations of these 
conserved amino acid residues are very important in relation to 
increasing the stability at acidic pH and/or at low calcium 
concentration at high temperatures. 

Property ii) (see above) of the a-amylase, i.e., the 
immunological cross reactivity, may be assayed using an antibody 
raised against, or reactive with, at least one epitope of the 
relevant Termamyl-like a-amylase. The antibody, which may either 
25 be monoclonal or polyclonal, may be produced by methods known in 
the art, e.g,, as described by Hudson et al . , Practical 
Immunology, Third edition (1989), Blackwell Scientific 
Publications. The immunological cross-reactivity may be determi- 
ned using assays known in the art, examples of which are Western 
Blotting or radial immunodiffusion assay, e.g., as described by 
Hudson et al . , 1989. In this respect, immunological cross- 
reactivity between the a-amylases having the amino acid 
sequences SEQ ID NOS: 1, 2, 3, 4, 5, 6. 7 , or 8 respectively, 
have been found. 

The oligonucleotide probe used in the characterization of 
the Termamyl-like a-amylase in accordance with property iii) 



30 



35 
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above may suitably be prepared on the basis of the full or 
partial nucleotide or amzno acid sequence of the a-amylase in 
question . 

Sua table conditions for testing hybridization in^rolve 
5 presoaking in SxSSC and prehybridising for 1 hour at -40 °C in a 
solution of 20% formamide, SxDenhardt ' s solution, BOmH sodium 
phosphate, pH 6.3, and 50mg of denatured sonicated calf thymus 
DNA, followed by hybridization in the same solution supplemented 
with lOOmM ATP for 18 hours at -40=*C, followed by three times 
10 washing of the filter in 2xSSC, 0.2% SDS at 40°C for 30 minutes 
(low stringency) , preferz-ed at 50*C (medium stringency) , more 
preferably at eS^'C (high stringency} , even more preferably at 
-75^C (very high stringency) . More details about the 
hybridization method can be found in Sambrook et al . , Molecular 
J 15 Cloning: A Laboratory Manual , 2nd Ed,, Cold Spring Harbor, 1989. 
- In the present context, "derived from" is intended not only 

to indicate an a-amylase produced or producible by a strain of 
^ the organism in question, but also an a-amylase encoded by a DNA 
^ sequence isolated from such strain and produced in a host or- 
1; 20 ganism transformed with said DNA sequence. Finally, nhe term is 
intended to indicate an a-amylase which is encoded by a DNA 
sequence of synthetic and/or cDNA origin and which has the 
identifying characteristics of the a-amylase in question. The 
term is also intended to indicate that the parent a-amylase may 
2S be a variant of a naturally occurring a-amylase, i.e., a variant 
which is the result of a modification (insertion, substitution, 
deletion) of one or more ammo acid residues of the naturally 
occurring a-amylase. 

3 0 Parent hybrid a- amylases 

The parent a- amylase (backbone) may be a hybrid a-amylase, 
i^e., an a-amylase which comprises a combination of partial 
amino acid sequences derived from at least two a-amylases. 

The parent hybrid a-amylase may be one which on the basis of 
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amino acid homology and/or immunological cross -reactivity and/or 
DNA hybridization (as defined above) can be determined to belong 
to the Termamyl-like a-amylase family. In this case, the hybrid 
a-amylase is typically composed of at least one part of a 
5 Termamyl-like a-amylase and part(s) of one or more other a- 
amylases selected from Termamyl-like a-amylases or non-Termamyl- 

like a-amylases of microbial (bacterial or fungal) and/or 
mammalian origin. 

ThuS/ the parent hybrid a-amylase may comprise a combination 
10 of partial amino acid sequences deriving from at least two 
Termamyl-like a-amylases, or from at least one Termamyl-like and 
at least one non-Termamyl-like bacterial a-amylase, or from at 
least one Termamyl-like and at least one fungal a-amylase. The 
Termamyl-like a-amylase from which a partial amino acid sequence 
I'^is derives may, e.g., be any of those specific Termamyl-like aa- 
'^'^ myiase referred to herein. 

For instance / the parent a-amylase may comprise a C-terminal 
f7 part of an a-amylase derived from a strain of B. lichenlformis ^ 
ill; and a N-terminal part of an a-amylase derived from a strain of 
^"J20 S, amyloliquefaclGns or from a strain of B, stea.2rothBrmophllus . 
For instance, the parent a-amylase may comprise at least 43 0 
amino acid residues of the C-terminal part of the B. 
lichenlfojrmls a-amylase. A such hybrid Termamyl-like a-amylase 
may be identical to the Ba^cillus lichenlformls a-amylase shown 
25 in SEQ ID NO: 4, except that the N-termmal 35 ammo acid 
residues (of the mature protein) is replaced with the N-terminal 
33 amino acid residues of the mature protein of the Ba.clllus 
amylaliquefaclens a-amylase (BAN) shown in SEQ ID NO: 5. A such 
hybrid may also consist of an amino acid segment corresponding 
30 to the 68 N-terminal amino acid residues of the B, stearotrher- 
mophllus a-amylase having the amino acid sequence shown in SEQ 
ID NO: 3 and an amino acid segment corresponding to the 415 C- 
terminal amino acid residues of the B. lichenifojrmis a-amylase 
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having the amino acid sequence shown in SEQ ID NO: 4. 

The non-Termamyl-like a-amylase may/ e.g., be a fungal a- 
amylase, a TnamTnalian 02: a plant a-amylase or a bacterial a- 
amylase (different from a Termamyl-like a-amylase) . Specific 
5 examples of such a-amylases include the Asperg-illus oryzae tAKA 
a-amylase, the A. niger acid a-amylase, the Ba^cxllus subtilis 
a-amylase, the porcine pancreatic a-amylase and a barley a- 
amylase. All of these a-amylases have elucidated structures 
which are markedly different from the structure of a typical 
10 Termamyl-like a-amylase as referred to herein. 
Q The fungal a-amylases mentioned above, i.e. derived from A> 

niger and A, oryzae ^ are highly homologous on the amino acid 
level and generally considered to belong to the same family of 
a-amylases. The fungal a-amylase derived from Aspergillus 
kihiB oryzae is commercially available under the tradename Fungamy 1*^'^ , 

Furthermore, when a particular variant of a Termamyl-like a- 
1;.^ amylase (variant of the invention) is referred to - in a 
1"^ conventional manner - by reference to modification (e.gr,, 
JIT deletion or substitution) of specific amino acid residues in the 
b:p20 amino acid sequence of a specific Termamyl-like a-amylase, it is 
to be understood that variants of another Termamyl-like a- 
amylase modified in the equivalent position (s) {as determined 
from the best possible amino acid sequence alignment between the 
respective amino acid sequences) are encompassed thereby, 
25 A preferred embodiment of a variant of the invention is one 

derived from a B. licheniformis a-amylase (as parent Termamyl- 
like a-amylase), e.g., one of those referred to above, such as 
the B. licheniformis a-amylase having the amino acid sequence 
shown in SBQ id NO : 4 , 

30 

Altered properties of variants of the invention 

The following discusses the relationship between 
alterations/mutations which may be present in variants of the 
invention, and desirable alterations in properties (relative to 
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those a parent, Termamyl-like a- amylase) which may result 
therefrom. 

Increased stability at acidic oH and/or low calcium 
5 concentration at high temperatures 

The present invention relates to a variant of a parent 
Termamyl-like a-amylase, which variant a-amylase has been 
altered xn comparison to the parent a-amylase in one or more 
solvent exposed amino acid residues on the surface of the a- 
10 amylase to increase the overall hydrophibicity of the a-amylase 
and/or to increase the overall numbers of methyl groups m the 
sidechains of said solvent exposed amino acid residues on the 
surface . 

In a preferred embodiment one or more solvent exposed amino 
"J acid residues on a concaiv surface wxth inwards bend are altered 
to more hydrophobic amino acid residues. 

In another preferred embodim.ent one or more solvent exposed 
amino acid residues on a convex surface are altered to increase 
the number of methyl groups in the sidechain, 
20 The present invention relates to an a-amylase variant of a 

parent Termamyl-like a-amylase, comprising an alteration at one 
or more positions selected from the group of: 
E376, S417, A420, S356, Y35S; 

wherein (a) the alteration (s) are independently 
25 (i) an insertion of an ammo acid downstream of the amino 

acid which occ-upies the position, 

(ii) a deletion of the ammo acid which occupies the 
position, or 

(iii) a substitution of the amino acid which occupies the 
30 position with a different amino acid. 

(b) the variant has a-amylase activity and (c) each position 
corresponds to a position of the amino acid sequence of the 
parent Termamyl-like a-amylase having the amino acid sequence of 
SEQ ID NO; 4. 

35 In an embodiment the alteration is one of the following 
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substitutions : 

B3 76A,R,D,C,Q,G,H,I,K,L,M,N,F,P, S,T,W, Y,V. 
In a preferred embodiment the substitution is: E376K, 

In an embodiment the alteration ia one of the following 
s substitutions : S417A,R,r),C.E,Q,G,H, I . K, L, M,N; F ; W, Y, V; 

In a preferred embodiment the substitution is S417T. 

In an embodiment the alteration is one of the following 
substitutions A42 0R,D,C,E,Q,G,K, I,K,L,M,N,F,P,S;T,W,Y,V7 
In a preferred embodiment the substitution is: A420Q,R. 
10 In an embodiment the alteration is one of the following 

substitutions ; S3 5 6A, R, D, C, E, Q, G, H, I , K, L , M, N, F , P. Y, V . 

In an embodiment the alteration is one of the following 
substitutions Y3 58A,R,D,C,E,Q,G,H, I , K, M.N, S, T, V. 
In a preferred embodiment the substitution is Y358F. 
15 In an embodiment of the invention a variant comprises one or 

more of the following substitutions: E376K, S417T, A420Q,R, 
S35SA, Y358F. 

The increase in stability at acidic pH and/or low calcium 
concentration at high temperatures may be determined using the 
1^^^ 20 method described below in Example 2 illustrating the invention. 
i^^ The parent Termamyl-like a-amylase used as the backbone for 

preparing variants of the invention may be any Termamyl-like a- 
^1 amylases as defined above. 

Specifically contemplated are parent Termamyl-like a- 
25 amylases selected from the group derived from B. lichenifarmis, 
such as B. liahenifoxmis strain ATCC 27811^ amyloli- 
quefa^ci^nB, stea2rot±termophiluSr Bacillus sp. NCIB 12289, NCIE 
12512, NCIB 12513 or DSM 9375, and the parent Termamyl-like a- 
amylases depicted in SEQ ID NOS : 1, 2, 3, 4, 5, 6, 7 and 
3 0 In an embodiment of the invention the parent Termamyl-like 

a-amylase is a hybrid a-amylase being identical to the BB.cillus 
lxcheni£oTm±s a-amylase shown in SEQ ID NO; 4 (Termamyl) , except 
that the N-terminal 35 amino acid residues (of the mature 
protein) is replaced with the N-terminal 33 amino acid residues 
35 of the mature protein of the Ba,a±llus amyloliquefaclens a- 
amylase (BAN) shown in SEQ ID NO: 5. The parent Termamyl-like 
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hybrid a-amylase may be the above mentioned hybrid Termamyl-like 
a-amylase which further has the following mutations; 
H156Y+181T+190F+209V+264S (using the nuinbering m SEQ ID NO: 4) , 
Said backbone is referred to below as "LE174" , 
s The parent a-amylase may advantageously further have a 

mutation in one or more of the following positions: K176, 1201 
and H205 (using the numbering in SEQ ID NO: 4), especially one 
or more the followxng substitutions: K176R, 12 OIF, and H2 05N 
(using the numbering in SEQ ID NO: 4) , such as specifically the 
10 following substitutions: k:176R+I201F+H2 0 5N (using the numbering 
in SEQ ID NO; 4) , 

The inventors have found that the above mentioned variants 
have increased stability at pHs below 7.0 (i,e-, acidic pH) 
and/or at calcium concentration below ImM (40ppm) (i.e, low 
15 calcium concentrations) at temperatures xn nhe range from 95 to 
160''C (i.e., high temperatures) relative to the parent 
Termamyl-like a-amylase. 

Alterations (e.g., by substitution) of one or more solvent 
exposed amino acid residues which 1) increase the overall 
hydrophobicity of the enzyme, or 2) increase the number of 
methyl groups in the sidechains of the solvent exposed amino 
acid residues improve the temperature stability. It is preferred 
to alter (e.g,, by substitution) to m.ore hydrophobic residues on 
a concav surface with inwards bend. On a convex surface ^ 
25 alterations (e.g., by substitution) to amino acid residues with 
an increased number of methyl groups in the sidechain are 
preferred. 

Using the program CAST found on the internet at 
http://sunrise.cbs-umn.edu/cast/ version 1.0 (release Feb, 

30 1998) , (reference : Jie Liang, Herbert Edelsbrunner , and Clare 
Woodward. 1998, Anatomy of protein Pockets and Cavities: 
Measurements of binding site geometry and implications for 
ligand design. Protein Science, 7, pp, 1884-1S97) , a concave 
area which access to the surface can be identified. Access to 

35 the surface is in the program defined as a probe with a diameter 
of 1.4A can pass in and out. Using default parameters in the 



20 
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CAST prog^ram cancave cavities can be found using the Calcium 
depleted alpha- amylase structure from B, licheniformls as found 
in the Brookhaven database (IBPL) : 

Three types of interaction can be rationalised: 
5 A. Interaction between the sidechain of the residue and the 
protein, 

B- Interaction between the sidechain of the residue and the 
surrounding water, 

C. Interaction between the water and the protein. 
10 Using the parent Termamyl-like a- amylase shown in SEQ ID NO: 

4 as the backbone the following positions are considered to be 
solvent exposed and may suitably be altered: 
E376, S417, A420, S356, Y358 , 

Corresponding and other solvent exposed positions on the 
^5 surface of other Termamyl-like a-amylase may be identified using 
the dssp program by W. Kabsch and C, Sander, Biopolymers 22 
(1983) pp. 2577-2637. The convex surfaces can be identified 
using the the AACAVI program part from the WHATIF package (G. 
Vriend, Whatxf and drug design program. J. Mol . Graph. 8, pp. 
""20 52-56. (1590) version 19960317). 

p In an embodiment; of the invention a variant comprises one or 

0 more of the following substitutions: E376K, S417T, A420Q,R; 
3 S356A, Y358P- 

The inventors have found that the stabxlity at acidic pK 
25 and/or low calcium concentration at high temperatures may be 
increased even more by combining mutations in the above 
mentioned positions, i.e,, E375, S417, A420, S356, Y358, (using 
the SEQ ID NO: 4 numbering) with mutations in one or more of 
positions K17 6, 1201, and H2 05. 
30 The following additional substitutions are preferred: 

K17 6A,R,D,C, S,Q,G,H, I,L,M,N,F,P, S,T,W, Y, V; 
I2 01A,R,D, C,E,Q,G,H,L,K,M,N,F, P,S,T, W, V; 
H2 05A,R,D,C, E,Q,G, I,L,K,M,lNr, F, P, S, T,W, Y, V; 

As also shown in Example 2 illustrating the invention 
35 combining the following mutations give increased stability: 
K176+I201F+H205N+E376K+A420R or 
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K176+I201F+H205N+S417T+A420Q or 

K176+I201F4-H205N+S356A+Y358F using the hybrid a-amylase referred 
to as LEI 74 as the parent Termamyl-like a-amylase. 

5 General mutat ions in variants of the invention 

It may be preferred that a variant of rhe invention 
comprises one or more modifications in addition to those 
outlined above. Thus, ir may be advantageous that one or more 
proline residues present in the part of the a-amylase variant 
10 whach is modified is/are replaced with a non-proline residue 
which may be any of the possible, naturally occurring non- 
proline residues, and v/hich preferably is an alanine, glycine ^ 
serine, threonine, valine or leucine. 

Analogously, it may be preferred that one or more cysteine 
15 residues present among the amino acid residues with which the 
parent a-amylase is modified is/are replaced with a non-cysteine 
residue such as serine, alanine, threonine, glycine, valine or 
leucine . 

Furthermore, a variant of the invention may - either as the 
20 only modification or in combination with any of the above 
outlined modifications - be modified so that one or more Asp 
and/or Glu present in an ammo acid fragment corresponding to 
the ammo acid fragment 185-209 of SEQ ID NO: 4 is replaced by 
an Asn and/or Gin, respectively. Also of interest is the 
25 replacement, m the Termamyl-like a-amylase, of one or more of 
the Lys residues present in an amino acid fragment corresponding 
to the amino acid fragment 1S5-209 of SEQ ID NO : 4 by an Arg. 

It will be understood that the present invention encompasses 
variants incorporating two or more of the above outlined 
30 modifications . 

Furthermore. it may be advantageous to introduce point - 
mutations in any of the variants described herein- 
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Cloning a DNA sequence encoding an a-amylaae of the invention 

The DNA sequence encoding a parent a-anrylase may be isolated 
£rcm any cell or microorganism producing the a-amylase in 
question, using various methods well known in the art. First, a 
5 genomic DNA and/ or cDNA library should be constructed using 
chromosomal DNA or messenger RNA from the organism that produces 
the a- amylase to be studied. Then, if the amino acid sequence of 
the a-amylase is known, homologous, labelled oligonucleotide 

probes may be synthesized and used to identify a-amylase- 
10 encoding clones from a genomic library prepared from the 
,,,, organism in question. Alternatively, a labelled oligonucleotide 
0 probe containing sequences homologous to a known a-amylase gene 
F.:; could be used as a probe to identify a- amylase -encoding clones, 
using hybridization and washing conditions of lower stringency. 
IS Yet another method for identifying a -amylase -encoding clones 

i| would involve inserting fragments of genomic DNA into an ex- 
pression vector, such as a plasmid, transforming a- amylase - 
„:^!. negative bacteria with the resulting genomic DNA librairy, and 
'J then plating the transformed bacteria onto agar containing a 
|j 20 substrate for a-amylase, thereby allowing clones expressing che 
a-amylase to be identified. 

Alternatively, the DNA sequence encoding the enzyme may be 
prepared synthetically by established standard methods, e,g, the 
phosphoroamidite method described by S.L. Beaucage and M.H. 
25 Caruthers (1981) or the method described by Matthes et al . 
(1984) . In the phosphoroamidite method, oligonucleotides are 
synthesized, e*g-, in an automatic DNA synthesizer, purified, 
annealed, ligated and cloned in appropriate vectors . 

Finally, the DNA sequence may be of mixed genomic and syn- 
30 thetic origin, mixed synthetic and cDNA origin or mixed genomic 
and cDNA origin, prepared by ligating fragments of synthetic, 
genomic or cDNA origin (as appropriate, the fragments 
corresponding to various parts of the entire DNA sequence) , in 
accordance with standard techniques. The DNA sequence may also 
35 be prepared by polymerase chain reaction (PGR) using specific 
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primers, for instance ais described in US 4,683,202 or R.K. Saiki 
et al. (1938) . 

Site-directed mutagenesi s 
5 Once an a -amylase -encoding DNA sequence has been isolated, 
and desirable sites for mutation identified, mutations may be 
introduced using synthetic oligonucleotides. These oligonucleot- 
ides contain nucleotide sequences flanking the desired mutation 
sites; mutant nucleotides are inserted during oligonucleotide 

10 synthesis. In a specific method, a single -stranded gap of DISTA, 
bridging the a -amylase -encoding sequence^ is created in a vector 
carrying the a-amylase gene. Then the synthetic nucleotide, 
bearing the desired mutation^ is annealed to a homologous 
portion of the single-stranded DNA. The remaining gap is then 

15 filled in with DNA polymerase I (Klenow fragment) and the 
construct is ligated using T4 ligase . A specific example of this 
method is described in Morinaga et al , (1984). US 4,760,025 
discloses the introduction of oligonucleotides encoding multiple 
mutations by performing minor alterations of the cassette^ How- 

20 ever, an even greater variety of mutations can be introduced at 
any one time by the Honnaga method, because a multitude of 
oligonucleotides, of various lengths, can be introduced- 

Another method for introducing mutations into a-amylase- 
encoding DNA sequences is described in Nelson and Long (1989) . 

25 It involves the 3 -step generation of a PGR fragment containing 
the desired mutation introduced by using a chemically syn- 
thesized DNA strand as one of the primers in the PCR reactions. 
From the PCR-generated fragment , a DNA fragment carrying the 
mutation may be isolated by cleavage with restriction 

30 endonucleases and reinserted into an expression plasmid. 
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Random MutaQenesis 

Random m-at agenesis is suitably performed either as localised 
or region- specif ic random mutagenesis in at least three parts of 
the gene translating to the amino acxd sequence shown in 
s question, or within the whole gene. 

The random mutagenesis of a DNA sequence encoding a parent 
a-amylase may be conveniently performed by use of any method 
known in the art , 

In relation to the above, a further aspect of the present 
10 invention relates to a method for generating a variant of a 
parent a-amylase, e.g., wherein the variant exhibits altered or 
increased thermal stability relative to the parent, the method 
comprising : 

(a) subjecting a DNA sequence encoding the parent a- 
is amylase to random mutagenesis, 

(b) expressing the mutated DNA sequence obtained in step 
(a) in a host cell, and 

(c) screening fox- host cells expressing an a-amylase 
variant which has an altered property (i.e- thermal stability) 

20 relative to the parent a-amylase. 

Step (a) of the above method of the invention is preferably 
performed using doped primers. 

For instance, the random mutagenesis may be performed by use 
of a suitable physical or chemical mutagenizing agent, by use of 

25 a suitable oligonucleotide, or by subjecting the DNA seq[uence to 
PGR generated mutagenesis. Furthermore, the random mutagenesis 
may be performed by use of any combination of these mutagenizing 
agents- The mutagenizing agent may, e.g,, be one which induces 
transitions, transveraions , inversions, scrambling, deletions, 

3 0 and/or insertions- 
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Examples of a physical or chemical mutagenizing agent 
suitable for the present purpose include ultraviolet (UV) ir- 
radiation, hydroxyl amine, N-methyl-N' -nitro-N-nitrosoguanidine 
(MNNG) , O -methyl hydro:>cylamine , nitrous acid, ethyl methane 
5 sulphonate (EMS) , sodium bisulphite, formic acid, and nucleotide 
analogues. When such agents are used, the mutagenesis is 
typically performed by incubating the D3SIA sequence encoding the 
parent enzyme to be mutagenized in the presence of the 
mutagenizing agent of choice under suitable conditions for the 
10 mutagenesis to take place, and selecting for mutated DNA having 
the desir-ed properties. 

When the mutagenesis is performed by the use of an 
oligonucleotide, the oligonucleotide may be doped or spiked with 
the three non-parent nucleotides during the synthesis of the 
X5 oligonucleotide at the positions which are to be changed. The 
doping or spiking may be done so that codons for unwanted amino 
acids are avoided. The doped or spiked oligonucleotide can be 
incorporated into the DNA encoding the alpha-amylase enzyme by 
any published technique, using e.^. PGR, LCR or any DNA 
^20 polymerase and ligase as deemed appropriate. 

Preferably, the doping is carried out using "constant random 
doping", in which the percentage of wild-type and mutation in 
each position is predefined. Furthermore, the doping may be 
directed toward a preference for the introduction of certain 
25 nucleotides, and thereby a preference for the introduction of 
one or more specxfic amino acid residues. The doping may be 
made, e,gr. , so as to allow for the introduction of 90% wild type 
and 10% mutations in each position. An additional consideration 
in the choice of a doping scheme is based on genetic as well as 
30 protein-structural constraints. The doping scheme may be made 
by using the DOPE program which, inter alia, ensures that 
introduction of stop codons is avoided. 

When ?CR-generated mutagenesis is used, either a chemically 
treated or non-treated gene encoding a parent a-amylase is 
35 subjected to PGR under conditions that increase the mis- 
incorporation of nucleotides (Deshler 1992; Leung et al . , 
Technique, Vol , 1 , 1989, pp. 11-15). 
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A mutator strain of E, coll (Fowler et al . , Molec. Gen, 
Genet., 133, 1974, pp. 179-191), S. cereviseae or any other 
microbial organism may be used for the random mutagenesis of the 
DNA encoding the a-amylase by, transforming a plasmid 

5 containing the parent glycosylase into the mutator strain, 
growing the mutator strain with the plasmid and isolacing the 
mutated plasmid from the mutator strain. The mutated plasmid 
may be subsequently transformed into the expression organism. 

The DNA sequence to be mutagenised may be conveniently 
10 present m a genomic or cDNA library prepared from an organism 
expressing the parent alpha-amylase . Alternatively, the DNA 
sequence may be present on a suitable vector such as a plasmid 
;4 ^ bacteriophage, which as such may be incubated with or 

p; other-wise exposed to the mutagenising agent. The DNA to be 
^15 mutagenized may also be present in a host cell either by being 
j integrated in the genome of said cell or by being present on a 
vector harboured in the cell. Finally, the DNA to be mutagenized 
"r^ may be in isolated forrtu It will be understood that the DNA 
sequence to be subjected to random mutagenesis is preferably a 
„:,20 cDNA or a genomic DNA secjuence, 

Z. some cases it may be convenient to amplify the mutated 

]: DNA sequence prior to performing the expression step b) or the 
g screening step c) . Such amplification may be performed in 

accordance with methods known in the art, the presently 
25 preferred method being PCR-generated amplification using 

oligonucleotide primers prepared on the basis of the DNA or 

amino acid sequence of the parent enzyme. 
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Subsequent to the incubation with or exposure to the 
mutagenising agent, the mutated DNA is expressed by culturing a 
suitable host cell carrying the DNA sequence under conditions 
allowing expression to take place. The host cell used for this 
5 purpose may be one which has been transformed with the mutated 
DNA sequence, optionally present on a vector, or one which was 
carried the DNA sequence encoding the parent enzyme during the 
mutagenesis treatment. Examples of suitable host cells are the 
following: gram positive bacteria such as Baclllua subtiliSf 
10 Ba.cillus lichenlformiSj Bacillus lentus ^ Bacillus jbrevis, 
Bacillus 3teajrotherinophilus, Bacillus alkalophilus , Bacillus 
amyloliqijefaciens, Bacillus coagulans, Bacillus circulans , 
Bacillus lautus. Bacillus megaHerium, Bacillus thuringiensis , 
r;: St2reptomyc&s lividans or Strsptomyces iDurinus; and gram-negative 
•:'-^i5 bacteria such as E. coli. 

The mutated DNA sequence may further comprise a DNA sequence 
encoding functions permitting expression of the mutated DNA 
i-4 sequence . 

Localized random mutagenesis 

The random^ mutagenesis may be advantageously localized to a 
part of the parent a-amylaae in question. This may, e.g., be 
advantageous when certain regions of the enzyme have been 
identified to be of particular importance for a given property 

25 of the enzyme, and when modified are expected to result in a 
variant having improved properties. Such regions may normally be 
identified when the tertiary structure of the parent enzyme has 
been elucidated and related to the function of the enzyme. 

The localized, or region-specific, random mutagenesis is 

30 conveniently performed by use of PCR gener-ated mutagenesis 
techniques as described above or any other suitable technique 
known in the art. Alternatively, the DNA sequence encoding the 
part of the DNA sequence to be modified may be isolated, e.g-., 
by insertion into a suitable vector, and said part may be 

35 subsequently subjected "co mutagenesis by use of any of the 
mutagenesis methods discussed above. 
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Alternative met hods of providing a- amylase i/ariants 

Alternative methods for providing variants of the invention 
include gene shuffling method known in the art including the 
methods; e.^-., described in WO 95/22625 (from Affymax 
5 Technologies N.V.) and WO 96/00343 (from Novo Nordisk A/S) . 

Expression of a- amylase variants of the invention 

According to the invention, a DNA sequence encoding the 
variant produced by methods described above, or by any alterna- 
10 tive methods known in the art, can be expressed, in enzyme form, 
using an expression vector which typically includes control 
sequences encoding a promoter, operator, ribosome binding site, 
1 translation initiation signal, and, optionally, a repressor gene 
or various activator genes. 
15 The recombinant expression vector carrying the DNA sequence 

4 encoding an a-amylase variant of the invention may be any vector 
which may conveniently be subjected to recombinant DNA 
procedures, and the choice of vector will often depend on the 
host cell into which it is to be introduced. Thus, the vector 
^^20 may be an autonomously replicating vector, i.e., a vector which 
r| exists as an extra chromosomal entity, the replication of which 
3 is independent of chromosomal replication, e,g., a plasmid, a 
bacteriophage or an extrachromosomal element, mini chromosome or 
an artificial chromosome. Alternatively, the vector may be one 
25 which, when introduced into a host cell, is integrated into the 
host cell genome and replicated together with the chromosome (s) 
into which it has been integrated. 

In the vector, the DNA sequence should be operably connected 
to a suitable promoter sequence. The promoter may be any DNA 
30 sequence which shows transcriptional activity in the host cell 
of choice and may be derived from genes encoding proteins either 
homologous or heterologous to the host cell . Examples of 
suitable promoters for directing the transcription of the DNA 
sequence encoding an a-amylase variant of the invention, 
35 especially in a bacterial host, are the promoter of the lac 
operon of E.aoli, the Streptzomycos coellcolor agarase gene dagA 
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promoters, the promoters of the Bacillus llchenifozwis a-amylase 
gene (arayL) , the promoters of the Bacillus stearothermophllus 
maltogenic amylase gene (amyM) , the promoters of the Bacillus 
amyloliquGfaciens a-amylase {amyQ) , the promoters of the Sa- 
5 cillus subtzilis xylA and xylB genes etc. For transcription in a 
fungal host, examples of useful promoters are those derived from 
the gene encoding A. oryzae TAKA amylase, Rhizomucor miehei 
aspartic proteinase, A. niger neutral a-amylase, A. niger acid 
stable a-amylase, A. niger glucoamylase, Rhizomucor miehei 
10 lipase, A. oryzae alkaline protease. A, oryzae triose phosphate 
isomerase or A. nidulans acetamidase. 

The expression vector of the invention may also comprise a 
suitable transcription terminator and, in eukaryotes, poly- 
adenyiation sequences operably connected to the DNA sec[uence 
!^-^i5 encoding the a-amylase variant of the invention. Termination and 
polyadenylation sequences may suitably be derived from the same 
sources as the promoter. 

The vector may further comprise a DNA sequence enabling the 
vector to replicate in the host cell in question. Examples of 
^i,2o such sequences are the origins of replication of plasmids pUCl9, 
^ PACYC177, pUBllO, pE194 , pAMBl and pIJ702 . 

^ The vector may also comprise a selectable marker, e.g. a 

gene the product of which complements a defect in the host cell, 
such as the dal genes from B. subtilis or B. 1 1 cheni farmi s , or 

25 one which confers antibiotic resistance such as ampicillin, 
kanamycin, chloramphenicol or tetracyclin resistance. Fur- 
thermore, the vector may comprise Aspergillus selection markers 
such as amdS, argB, niaD and sC, a marker giving rise to 
hygromycin resistance, or the selection may be accomplished by 

30 CO -trans formation, e.g. as described in WO 91/17243. 

While intracellular expression may be advantageous in some 
respects, e.g. when using certain bacteria as host cells, it is 
generally preferred that the expression is extracellular. In 
general, the Bacillus a-amylases mentioned herein comprise a 

35 preregion permitting secretion of the expressed protease into 
the culture medium, if desirable, this preregion may be replaced 
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by a different preregion or signal sequence, conveniently accom- 
plished by substitution of the DNA sequences encoding the 
respective preregions . 

The procedures used to ligate the DNA construct of the 
5 invention encoding an a-amylase variant: . the promoter/ 
terminator and other elements, respectively, and to insert them 
xnto suitable vectors containing the information necessary for 
replication, are well known to persons skilled in the art (cf., 
for instance, Sambrook et al . , Molecular Clonincr: a Laboratory 
IQ Manual , 2nd Ed., Cold Spring Harbor, 1989) . 

The cell of the invention, either comprising a DNA construct 
or an expression vector of the invention as defined above ^ is 
p.. advantageously used as a host cell in the recombinant production 
vJ of an a-amylase variant of the invention. The cell may be 
:r;i5 transformed with the DNA construct of the invention encoding the 
variant, conveniently by integrating the DNA construct (in one 
■ or more copies) in the host chromosome. This integration is 
I := generally considered to be an advantage as the DNA sequence is 
- more likely to be stably maintained in the cell- Integration of 
;"'"2o the DNA constructs into the host chromosome may be performed 
according to conventional methods, e-g*. by homologous or 
1:1 heterologous recombination. Alternatively, the cell may be 
'^^f transformed with an expression vector as described above in 
connection with the different types of host cells, 
as The cell of the invention may be a cell of a higher organism 

such as a mammal or an insect, but is preferably a microbial 
cell, e.gr., a bacterial or a fungal (including yeast) cell. 

Examples of suitable bacteria are grampositive bacteria such 
as Bacillus snbtills, Bacillus llchsniformis. Bacillus lentus, 
iio Bacillus jbrevis. Bacillus? stearotzhermophilus , Ba.clllu3 alkalo- 
philU3, Bacillus amyloliquefaciens, Ba^cillus coagulans r Bacillus 
circulans. Bacillus lautus, Bacillus megaterium, Bacillus 
thuring-iensis , or Streptor/iyces lividana or Streptomyces murinus, 
or gramnegative bacteria such as E,coli. The transformation of 
35 the bacteria may, for instance, be effected by protoplast trans- 
formation or by using competent cells in a manner known per se. 
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The yeast organism raay favourably be selected from a species 
of Saccharomyces or Schizoaaccharomycesj e.g^, Sa.ccharorr\yces 
cerevislsie . The filamentous fungus may advantageously belong to 
a species of Aspergillus , e.^r, Aspergillus oryzae or Aspergillus 
5 niger. Fungal cells may be transfo2rmed by a process involving 
protoplast formation and transformation of the protoplasts fol- 
lowed by regeneration of the cell wail in a manner known per se. 
A suitable procedure for transformation of Aspergillus host 
cells is described xn EP 238 023. 
10 In a yet further aspect, the present invention relates to a 

method of producing an ot-amylaae variant of the invention, which 
method comprises cultivating a host cell as described above 
under conditions conducive to the production of the variant and 
recovering the variant from the cells and/or culture medium. 

The medium used to cultivate the cells may be any conven- 
tional medium suitable for growing the host cell in question and 
obtaining expression of the a-amylase variant of the invention. 
Suitable media are availcible from commercial suppliers or may be 
prepared according to published recipes (e.g-., as described m 
catalogues of the Americatn Type Culture Collection) . 

The a~amylase variant secreted from the host cells may con- 
j veniently be recovered from the culture medium by well-known 
i procedures, including separating the cells from the medium by 
centrif ugation or filtration, and precipitating proteinaceous 
25 components of the medium by means of a salt such as amraonium 
sulphate, followed by the use of chromatographic procedures such 
as ion exchange chromatography, affinity chromatography, or the 
like , 

3 0 Industrial Applications 

The a-amylase variants of this invention possesses valuable 
properties allowing for a variety of industrial applications. An 
enzyme variant of the invention are applicable as a component in 
washing, dishwashing and hard- surface cleaning detergent 

35 compositions. Numerous variants are particularly useful in the 
production of sweeteners and ethanol from starch, and/or for 
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textile desizing. Conditions for con^rentional starch- conversion 
processes, including starch liquefaction and/or saccharif ication 
processes y are described in, e.^r-, US 3,912,590 and in EP patent 
publications Nos . 2 52,73 0 and 63,909. 

s 

Production of sweeteners from starch : 

A "traditional" process for conversion of starch to fructose 
syrups norinally consists of three consecutive enzymatic 
processes, viz. a liquefaction process followed by a sacchari- 
10 f ication process and an isomerization process. During the 
liquefaction process, starch is degraded to dextrins by an a- 
amylase {e.g. Termamyl™) at pH values between 5.5 and 6,2 and at 
"I; temperatures of 95-160*'C for a period of approx. 2 hours. In 
■7; order to ensure an optimal enzyme stability under these condi- 
j-;i5 tions, 1 mM of calcium is added (40 ppm free calcium ions). 
^-■^ After the liquefaction process the dextrins are converted 

into dextrose by addition of a glucoamylase [e.g. AMG^"") and a 
::,| debranching enzyme, such as an isoamylase or a pullulanase (e.gr. 

Promozyme™) . Before this step the pH is reduced to a value below 
'^"'■^20 4.5, maintaining the high temperature (above 95°C) , and the 
liquefying a-amylase activity is denatured. The temperature is 
-J lowered to SO°C, and glucoamylase and debranching enzyme are ad- 
'™ ded. The saccharif ication process proceeds for 24-72 hours. 

After the saccharif ication. process the pH is increased to a 
25 value in the range of 6-8 , preferably pH 7.5^ and the calcium is 
removed by ion exchange. The dextrose syrup is then converted 
into high fructose syrup usinoy e.g-., an immmobxlized gluco- 
seisomerase (such as Sweetzyme'") . 

At least 1 enzymatic improvements of this process could be 
3 0 envisaged. Reduction of the calcium dependency of the 
liquefying a-amylase. Addition of free calcium is required to 
ensure adequately high stability of the a-amylase, but free 
calcium strongly inhibits the activity of the glucoseisomerase 
and needs to be removed, by means of an expensive unit 
3S operation, to an extent which reduces the level of free calcium 
to below 3-5 ppm. Cost savings could be obtained if such an 
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operation could be avoided and the Liquefaction process could be 
performed without addition of free calcium ions. 

To achieve thaty a less calcium- dependent Termamyl-like 
a-amylase which is stable and haghly active at low 
5 concentrations of free calcium (< 40 ppm) is required. Such a 
Termamyl-like a-amylase. should have a pH optimum at a pH m the 
range of 4,5-6.5, preferably in the range of 4.5-5.5. 

Detergent compos i t i ons 
10 As mentioned above, variants of the invention may suitably 

be incorporated in detergent compositions. Reference is made, 

for example, to RO 96/23 874 and WO 97/07202 for further details 
'^fr concerning relevant ingredients of detergent compositions (such 

as laundry or dishwashing detergents) , appropriate methods of 
15 formulating the variants in such detergent compositions, and for 
i:"" examples of relevant typtes of detergent compositions. 
'l^'i Detergent compositions comprising a variant of the invention 

hi may additionally comprise one or more other enzymes, such as a 
- lipase, cutinase, protease ^ cellulase, peroxidase or laccase, 
;:^,^2o and/or another a-amylase, 

"""J^ a-amylase variants of the invent xon may be incorporated in 

^ detergents at conventionally employed concentrations. It is at 
,^ present contemplated that a variant of the invention may be 
incorporated in an amount corresponding co 0.00001-1 mg 

25 (calculated as pure, active enzyme protein) of a-amylase per 
liter of wash/dishwash liquor using conventional dosing levels 
of detergent . 

MATERIALS AND METHODS 
30 Enzymes: 

LE174 hybrid alpha -amylase variant: LE174 is a hybrid Termamyl- 
like alpha-amylase being identical to the Termamyl sequence, 
i.e., the Bacillus licheniformis a-amylase shown in SEQ ID NO: 
4/ except that the N-terminal 35 amino acid residues (of the 
2S mature protein) has been replaced by the K-terminal 33 residues 
of BAN (mature protein), i.e., the Bacillus amylollqu&faciens 
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alpha -amylase shown in SEQ ID NO: 5, which further 
havefollowing mutations : H156Y+A181T4-lsri90F+A2 09V+Q264S (using 
the nuTDbering in SEQ ID NO: 4) . 

5 Construction of pSNKlOl 

This E. coli/ Bacillus shuttle vector can be used to 
introduce mutations without expression of a-amylase in coli 
and then be modified in such way that the a-amylase is active 
in Bacillus. The vector was constructed as follows: The a- 
10 amylase gene in the pX vector (pDN152 8 with the following 
alterations within amyL : BAN(l-33), K15 6Y , A131T, N190F, A209V, 
Q264S, the plasmid pDN1528 is further described in Example 1) 
\% was inactivated by interruption in the PstI site in the 
5 "coding region of the alpha-amylase gene by a 1,2 Icb fragment 
15 containing an E. call origin fragment. This fragment was 
amplified from the puciS (GenBank Accession #:X02514) using the 
forward primer 1: 5 "-gacctgcagtcaggcaacta-S ' (SEQ ID NO; 28) 
"^■--^ and the reverse primer 1: 5 ' -tagagtcgacGtgcaggcat-3 (SEQ ID 
'-j^ NOi 29), The PCR amplicon and the pX plasmid containing the a- 
amylase gene were digested with PstI at BV'C for 2 hours. The 
■■Y\ pX vector fragment and the E, coli origin amplicon were ligated 
-J at room temperature, for 1 hour and transformed m E. coli by 
'■"^ electrotransf ormation. The resulting vector is designated 
pSnKlOl . 

25 This E. coli/Ba.cillus shuttle vector can be used to 

introduce mutations without expression of a-amylase in B. coli 
and then be modified in such way that the a-amylase is active 
in Bacillus. The vector was constructed as follows: The a- 
amylase gene in the pX vector (pDN152 8 with the following 

30 alterations within amyL: 3A:sr(l-33) , 

H156Y-hAieiT-t-N190F+A209V+Q264S, the plasmid pDN1528 is further 
described in Example 1) was inactivated by interruption in the 
PstI site in the 5 'coding region of the alpha-amylase gene by a 
1.2 kb fragment containing an E. coll origin fragment. This 

35 fragment was amplified from the pUC19 (GenBank Accession 
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#:X02514) using the forward primer 2: 5 " -gacctgcagtcaggcaacta- 
3" (SEQ ID NO: 3 0) and the reverse primer 2: 5"- 
tagagtcgacctgcaggcat-3 ' (SEQ ID NO: 31) . The PGR amplicon and 
the pX plasmid containing the a- amylase gene were digested 
5 with PstI at 37°C for 2 hours. The pX vector fragment and the 
E. coll origin amplicon were ligated at room temperature- for 1 
hour and transformed in E. aali by electrotransf ormation , The 
resulting vector is designated pSnKlOl. 

10 Low pH filter assay 

Bs-aillixs libraries are plated on a sandwich of cellulose 
acetate (OE 61, Schleicher & Schuell, Dassel, Germany) - and 
nitrocellulose filters (Protran-Ba 85, Schleicher & Schuell, 
jf: Dassel, Germany) on TY agar plates with 10 fxg/ml 
15 chloramphenicol at 37*^0 for at least 21 hrs . The cellulose 
acetate layer is located on the TY agar plate. 

Each filter sandwich is specifically marked with a needle 
after plating, but before xncubatioh in order to be able to 
U localize positive variants on the filter and the nitrocellulose 
?«=2o filter with bound variants is transferred to a container with 
]t% citrate buffer^ pH 4.5 and incubated at 90^C for 15 min. The 
0 cellulose acetate filters with colonies are stored on the TY- 
plates ac room temperature until use. After incubation, 
residual activity is detected on assay plates containing 1% 
2S agarose, 0.2% starch in citrate buffer, pH 6.0. The BLBsa-Y 
plates with nitrocellulose filters are marked the same way as 
the filter sandwich and incubated for 2 hours, at 50*^C. After 
removal of the filters the assay plates are stained with 10% 
Lugol solution. Starch degrading variants are detected as white 
30 spots on dark blue background and then identified on the 
storage plates. Positive variants are rescreened twice under 
the same conditions as the first screen- 

Secondax-y screening 

35 Positive transf ormants after rescreening sre picked from 

the storage plate and tested in a secondary plate assay. 
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Positive tranafortnants are grown for 22 hours at 3 7°C in 5 ml 
LB + chloramphenicol. The Bacillus culture of each positive 
tranaf ormant and a control LE174 variant were incubated in 
citrate buffer, pH 4.5 at 90^*0 and samples were taken at 
5 0,10,20,30,40,60 and SO minutes, A 3 microliter sample was 
spotted on a assay plate. The assay plate was stained with 10% 
Lugol solution. Improved variants were seen as variants with 
higher residual activity detected as halos on the assay plate 
than the backbone. The improved variants are determined by 
xo nucleotide sequencing. 



Fermentation and purification of a- amylase variants 

A S, subtilis strain harbouring the relevant expression 
plasmid is streaked on a LB-agar plate with 15 )ig/ml 
^15 chloramphenicol from -S0*^C stock, ap.d grown overnight ac 37^C. 

The colonies are transferred to 100 ml BPX media supplemented 
I with 15 ^xg/ml chloramphenicol in a 500 ml shaking flask. 
Composition of BPX medium: 

Potato starch 100 g/1 

■i,2o Barley flour 50 g/1 

J BAN 5000 SKB 0.1 g/1 

Sodium caseinate 10 g/1 

Soy Bean Meal 20 g/1 

Na^HPOi, 12 H.O 9 g/1 

25 Pluronic^"^ 0 , 1 g/1 

The culture is shaken at 37=»C at 270 rpm for 5 days. 
Cells and cell debris are removed from the fermentation 
broth by centrif ugation at 4500 rpm in 20-25 minutes. Afterwards 
the supernatant is filtered to obtain a completely clear 
30 solution* The filtrate is concentrated and washed on a UF- 
filter (10000 cut off membrane) and the buffer is changed to 
20mM Acetate pH 5,5. The UF-filtrate is applied on a S-sepharose 
F.F. and elution is carried out by step elution with 0 . 2M NaCl 
m the same buffer. The eluate is dialysed against lOmM Tris, pH 
35 9.0 and applied on a Q-sepharose P.P. and eluted with a linear 
gradient from 0-0. 3N! NaCl over 6 column volumes. The fractions 
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which contain the activity (measured by the Phadebas assay) are 
pooled, pH was adjusted to pH 7.5 and remaining color was 
removed by a treatment with 0.5% W/vol , active coal in 5 
minutes . 

5 

Stability determination 

All the stability trials are made using the same set up. 
The method xb i 

The enzyme is incubated under the relevant conditions (1- 
10 4). Samples are taken at 0, 5, 10, 15 and 30 minutes and 
diluted 25 times (same dilution for all taken samples) in assay 
buffer (O.lM 50mM Britton buffer pH 7.3) and the activity is 
measured using the Phadebas assay (Pharmacia) under standard 
•=^(5 conditions pH 7.3, 37°C. 

i";is The activity measured before incubation (0 minutes) is used 

r. as reference (100%) , The decline in percent is calculated as a 
LJ function of the incubation time. The table shows the residual 
l\ activity after 3 0 minutes of incubation. 

srso Activity determination. - (KNU) 

One Kilo alpah-amylase Unit (1 KNU) is the amount of enzyme 
which breaks down 5-26 g search (Merck/ Amylum Solubile, Erg. B 
,J 6, Batch 9947275) per hour in Movo Nordisk' s standard method for 
"3 determination of alpha- amylase based upon the following 
25 condition I 

Substrate soluble starch 

Calcium content in solvent 0.0043 M 

Reaction time 7-20 minutes 

Temperature 37®C 
30 pH 5,6 

Detailed description of Novo Nordisk's analytical method (AP 9) 
is available on request . 

Specific activity detenriiriation 

3S Assay for g-Amvlase Activity 

a-amylase activity is determined by a method 
employing Phadebas® tablets as substrate. Phadebas tablets 
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(Phadebas® Amylase Test, supplied by Pharmacia Diagnostic) 
contain a cross-linked insoluble blue -coloured starch polymer 
which has been mixed with bovine serum albumin and a buffer 
substance and tabletted. 
5 For every single measurement one tablet is suspended 

in a tube containing 5 ml 50 rrO^ Br it ton- Robinson buffer (50 mM 
acetic acid, 50 mM phosphoric acid, 50 mM boric acid, 0,1 mM 
CaClj, pH adjusted to the value of interest with NaOH) - The test 
is performed in a water bath at the temperature of interest. The 
xo a-amylase to be tested is diluted in x ml of 50 mM Britton- 
Robinson buffer. 1 ml of this a-amylase solution is added to the 
5 ml 50 mM Britton-Robinson buffer. The starch is hydrolysed by 
the a-amylase giving soluble blue fragments. The absorbance of 
the resulting blue solution, measured spectrophotometrically at 
r-/i5 620 nm, is a function of the a-amylase activity. 

It is important that the measured 620 nm absorbance 
'Lr; after 10 or 15 minutes of incubation (testing time) is in the 
range of 0.2 to 2.0 absorbance units at 620 nm. In this 
absorbance range there is linearity between activity and 
j2o absorbance (Lambert-Beer law) . The dilution of the enzyme must 
m therefore be adjusced to fit thxs criterion. Under a specified 
CI set of conditions (temp,, pH, reaction time, buffer conditions) 
1 tng of a given a-amylase will hydrolyse a certain amount of 
substrate and a blue colour will be produced. The colour 
25 intensity is measured at 620 nm. The measured absorbance is 
directly proportional to the specific activity (acti^rity/mg of 
pure a-amylaae protein) of the a-amylase in question under the 
given set of conditions. 

3 0 EXAMPLES 



Example 1 - 
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Construction, bv random mutagenesis, of Termamvl-like LE174 a- 
amylase variants having an improved stability at low tdH and a 
reduced dependency on calcium ions for stability compared to the 
parent enzyme . 

5 

Random mutagenesis 

To improve the stability at low pH and low calcium 

concentration of the parent LE174 a-amylaae variant random 

mutagenesis in preselected regions was performed, 
10 The regions were: 

Region: Residue: 

SERI A42 5-y43a 

1 SERII W411-L424 

SERIII G397-G410 
^15 SERV T3 69-H3S2 

\ SERVII G310-F323 

SERIX L346-P359 
'-^ For each six region, random oligonucleotides are 

synthesized using the same mutation rate (97 % backbone and 1% 
^0 of each of the three remaixning nucleotides giving 3% mutations) 

in each nucleotide position in the above regions, e.g., 1, 

position in condon for A425: 97%C, 1%A, 1%T, 1%G, The six 
"1 random oligonucleotides and if used complementary SOS helping 

primers are shown in tablesl-6: with the four distribution of 
25 nucleotides below. 



Table 1- 

RSERI: 5'-GC GTT TTG CCG GCC GAC ATA 312 234 322 243 333 133 
444 233 423 242 212 211 243 343 CAA ACC TGA ATT-3' (SEQ ID HO: 



Table 2 , 

RSERII: 5'-GC GTT TTG CCG GCC GAC ATA CAT TCG CTT TGC CCC ACC 
GGG TCC GTC TGT TAT TAA TGC CGC 311 133 241 122 243 113 341 432 
35 423 433 223 332 242 331 GCC GAC AAT GTC ATG GTG-3' (SEQ ID NO: 
16) 



15. NOV, 1 999 1 8:07 



NOVO NOROISK PAT -^5 44493256 . 



NO. 2740 P. 11/50 



36 



Table 3. 

RSERIII: 5'-GTC GCC TTC OCT TGT CCA 433 413 112 423 124 424 423 
411 121 123 124 324 243 233 GTA CGC ATA CTG TTT TCT-3' (SEQ ID 
s NO: 17) 

Helping primer FSERIII: 5'-TGG ACA AGG GAA GGC GAG AG-3' (SEQ 
ID NOr 18) 

Table 4, 

10 RSERV; 5-TAA GAT CGG TTC AAT TTT 424 222 311 443 144 112 223 
434 324 441 423 233 222 342 CCC GTA CAT ATC CCG GTA QAA-2 (SEQ 
ID NO: 19) 

:j Helping primer FSERV: 5 -AAA ATT GAA CCG ATC TTA-3 (SEQ ID NO: 
20) 

'\ls 

Table 5, 

FSERVII: 5'-TT CCA TGC TGC ATC GAC ACA GGG AGG CGG CTA TGA TAT 
GAG GAA ATT GCT GAA 344 213 442 342 223 311 431 233 422 411 123 
/' 442 213 122 TGT CGA TAA CCA-3 ' (SEQ ID NO: 21) 

'^l Helping primer RSERVII ; TGT CGA TGC AGC ATG GAA - 3' (SEQ 

[fi ID NO: 22) 

Table 6. 

25 FSERIX: 5'-GT CCA AAC ATG GTT TAA GCC 432 243 221 343 222 212 
232 313 114 441 123 244 121 333 TCA GGT TTT CTA CGG GGA-3' (SEQ 
ID NO: 23) 

Helping primer RSERIX: S'-GGC TTA AAC CAT GTT TGG AC-3" (SEQ ID 
NO: 24) 

30 

Distribution of nucleotides in each mutated nucleotide position 
1:97%A, 1%T, 1%C, 1%G 
2:97%T, 1%A, 1%C, 1%G 
3:97%C, 1%A, 1%T, 1%G 
35 4:97%G, 1%A, 1%T, 1%C 



Construction of plasmid libraries 
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Two approximately 1«4 kb fragments were PGR amplified using 
the primer IB: 5'-CGA TTG CTG ACG CTG TTA TTT GCG-3' and the 
random oligonucleotide apparent from table 1, respectively the 
random oligonucleotide apparent from table 2. The vector 
5 pSnKlOl and the PGR fragments v;ere digested with EcoRV and EagI 
for 2 hours. The approximately 3.6 kb vector fragment and the 
approximately 1.3 kb PGR fragments was purified and ligated 
overnight and transformed in to E,coli and then further 
transformed into a Bacillus host starin as described below. The 
10 random oligonucleotides apparent from Tables 3-6 (which by a 
common term is designated aSER and bSER in Fig. 2) for each 
region and specific B, llcheniformis primers IB (SEQ ID NO: 2G) 
and #63: S'-CTA TCT TTG AAC ATA AAT TGA AAC C-3' {SEQ ID NO: 
J: 21) covering the EcoRV and the EagI sites in the LE174 sequence 
='^'-:i5 are used to generate PGR- library- fragments by the overlap 
\\ extension method (Horton et al , , Gene^ 77 (1989), pp. 61-68) 
ill Figure 2 shows the PGR strategy. The PGR fragments are cloned 
\}l in the E. coll/ Bacillus shuttle vector pSNKlOl (see Materials 
and Methods) enabling mutagenesis in E, coll and immediate 
',^^20 expression in Bacillus subtllis preventing lethal accumulation 
■P- of amylases in E. calx. After establishing the cloned PGR 
■■■■•J fragments in E. coll, a modified pUClS fragment is digested out 
^% of the plasmid and the promoter and the mutated Termamyl gene 
is physically connected and expression can take place in the 
25 Ba,cillua host . 

Screening' 

The six libraries were screened in the low pH filter 
assays described in the ''Material and Methods" section above. 

30 

All variants listed in the table in Example 2 below was 
prepared as described in Example 1. 

EXAMPLE 2 
35 Measurement of stability 

Normally, industrial liquefaction processes is run at pH 
^6,0-6-2 with addition of about 40 ppm free calcium in order to 
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impr-ove the stability at 95°C-105°C. Variants of the invention 
have been made in order to improve the stability at 

1. lower pH than pH 6,2 and/or 

2. at free calcium levels lower than 40ppm free calcium. 

5 An assay which measures the stability at acidic pH, pH 5-0, 

in the presence of 5ppm free calcium was used to measure the 
increase in stability. 

10 (j,g of the variant was incubated under the following 
conditions: A 0.1 M acetate solution, pH adjusted to pH 5-0, 
10 containing 5ppm calcium and 5% w/w common corn starch (free of 
calcium) . Incubation was made in a water bath at 95°C for 30 
minutes . 

Results : 

fr-^is Increased stability at pH 5.0, 5 ppm calcium incubated at 95'C 
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5 


65 


61 


66 


66 


10 


58 


53 


60 


59 


15 


51 


48 


55 


56 


30 


36 


39 


45 


49 



Specific activity determination. 

The specific activity was determined using the Phadebas 
assay (Pharmacia) (described above) as activity/Trig enzyme- The 
20 activity was determined using the a-amylase assay described in 
the Materials and Methods section herein. 



LE174 with the following substitutions: 
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K176R+I201F-fH205N 

Specific activity determined: 13400NtJ/mg 

LE174 with the following substitutions: 
5 Kl 7 6R+ 12 01 F+H2 0 SKf 3 7 6 K4-A4 2 0 R : 

Specific activity determined; 14770NU/mg 

IjE174 with the following substitutions: 
K176R+I2 01F+H20 5N'fS417T+A42 0Q: 
10 Specific activity determined : 1667 ONU/mg 



LE174 with the following substitutions: 
K176R+I201F+H205N^S356A+Y'35aF; 
Specific activity determined: 153 0 ONU/mg 
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CLAIMS 

1. A variant of a parent Termamyl-like a^airiylase, which variant 
a-amylase has been altered in comparison to the parent a- 

5 amylase in one or more solvent exposed amino acid residues on 
the surface of the a-amylase to increase the overall 
hydrophobicity of the a-amylase and/or to increase the overall 
nurnbers of methyl groups in the sidechains of said solvent 
exposed amino acid residues on the surface. 

10 

2. The variant according to claim 1, wherein one or more solvent 
exposed amino acxd residues on a concav surface with inwards 

^ bend are altered to more hydrophobic amino acid residues. 

-15 3- The variant according to claim 1, wherein one or more solvent 
exposed amino acid residues on a convex surface are altered to 
ij increase the number of methyl groups in the sidechain. 

7 4. A variant of a parent Termamyl-like a-amylase, comprising an 
.ao alteration at one or more positions selected from the group of: 
f} E376, 3417, A420, S356, Y358; 

wherein (a) the alteration (s) are independently 

(i) an insertion of an amino acid downstream of the amino 
acid which occupies the position, 
25 (xi) a deletion of the amino acid which occupies the 

position, or 

(iii) a substitution of the amino acid which occupies the 
position with a different amino acid, 

(b) the variant has a-amylase activity and (c) each position 
3 0 corresponds to a position of the amino acid sequence of the 
parent Termamyl-like a-amylase having the amino acid sequence of 
SEQ ID NO: 4, 



5. The variant according to claim 4y which variant has an 
35 alteration in one or more solvenn exposed amino acid residues as 
defined in any of claims 1-3. 
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5- The variant of any of claims 1-5, wherein the parent 
Termamyl-like a-amylase is der-ived from a strain of B. licheni- 
formls, B. amyloliqu&faciens, S, HCearothern?ophlius^ Bacillus 
5 sp. NCIB 12289, NCIB 12512, NCIB 12513 or DSiM 9375. 

7, The variant accordin to claim 6, wherein the parent a-amylase 
is derived from B, licheniformis strain ATCC 27811. 

10 8. The variant according to claims 1-5, whersan the parent 
Termamyl-like a-amylase is any of the a-amylases selected from 
P the group depicted in SEQ ID NOS: 1, 2, 2, 4, S, 6, 7, and 8. 

9, The variant according to any of claims 1-8, wherein the 
^^\s parent Termamyl-like a-arnylase has an amino acid sequence which 
72 has a degree of identity to SEQ ID NO: 4 of at least 65%, 
|i| preferably 70%, more preferably at least 80%, even more 
: preferably at least about 90%, even more preferably at least 
. 95%, even more preferably at least 97%, and even more preferably 
l^^o at least 97%. 

10- The variant according to any of claims 1-10, wherein the 
parent Termamyl-like a-amylase is encoded by a nucleic acid 
sequence which hydridizes under medium, preferred high 
25 stringency conditions, with the nucleic acid sequence of SEQ ID 

NO : 12 , 

11. The variant accordint to claims 1-10, wherein the parent 
Termamyl-like a-amylase is a hybrid of the B. llah&niformls a- 

30 amylase shown in SEQ ID NO: 4 and B. amyloliquef^cx^ns a-amylase 
shown in SEQ ID NO ; 5 . 

12, The variant according to claim 11, wherein the parent hybrid 
Termayl-like a-amylase is LE174. 

25 
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13. The variant according to any of claims 1-12, wherein the 
parent a-amylase further has a mutation in one or more of the 
following positions: K176, 12 01 and H2 05 (using the numbering 
in SEQ ID NO: 4) . 

5 

14, The variant according to claim 13, wherein the parent a- 
amylase has one or more the following substitutions: K176R, 
I201F and/or H205N (using the numbering in SEQ ID NO: 4) . 

10 15 1 The variant according to claim 14, wherein the parent a- 
amylase has the following substitutions: K176R+1201F+H2D5N 
f;j (using the numbering in SEQ ID NO: 4) . 

16. The variant according to claims 1 to 15, wherein the variant 
v,":,-i5 has increased stability at pHs below 7.0 (acidic pH) and/or at 
v,; low calcium concentration and/or at temperatures xn the range 

from 95 to leC'C (high temperatures) relative to the parent a- 
!H amylase . 

1^20 17. The variant according to any of claims 1 to IS, which 
variant has one or more of the following substitutions: 
E376K, S417T; A420Q,R, Y358F. 

18, A DNA construct comprising a DNA sequence encoding an a- 
25 amylase variant according to any one of claims 1 to 17, 

19. A recombinant expression vector which carries a DNA con- 
struct according to claim 18. 

3a 20, A cell which is transformed with a DNA construct according 
to claim IB or a vector according to claim 19. 

21, A cell according to claim 20, v;hich is a microorganism. 



35 22 . A cell according to claim. 21, which is a bacterium or a 
fungus . 
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23. The Cell according to claim 22, whicln is a grampositive 
bacterium such as Bacillus subtllls , Bacillus licheniformls. 
Bacillus lentus, Bacillus brevis, Bacillus stearothermophilus , 

3 Bsicillus alka.lophilua f Bacillus amyloliquef3.ciens, Bacillus 
coag-ulans, Bacillus circulans, Bacillus lautus or Bacillus thu- 
ringiensis . 

24. A detergent additive comprising an a-amylase variant accor- 
10 ding to any one of claims 1 to 17, optionally ±xi the form of a 

non-dusting granulate, stabilised liquid or protected enzyme. 

25. A detergent additive according to claim 24 which contains 
0.02-200 mg of enzyme protem/g of the additive. 

26. A detergent additive according to claims 24 or 25, which 
additionally comprises another enzyme such as a protease, a 
lipase, a peroxidase, another amylolytic enzyme and/or a 

, cellulase. 

27. A detergent composition comprising an a-amylase variant ac- 
3 cording to any of claims 1 to 17. 

28. A detergent composition according to claim 27 which addi- 
25 tionally comprises another enzyme such as a protease, a lipase, 

a peroxidase, another amylolytic enzyme and/or a cellulase, 

29. A manual or automatic dishwashing detergent composition 
comprising an a-amylase variant according to any of claims 1 to 

30 17. 

30. A dishwashing detergent composition according to claim 29 
which additionally comprises another enzyme such as a protease, 
a lipase, a peroxidase, another amylolytic enzyme and/or a 

35 cellulase- 



31, A manual or automatic laundry washing composition comprising 
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an a-amylase variant according to any of claims 1 to 17 . 

32. A laundry washing composition according to claim 31, which 
additionally comprises another enzyme such as a protease, a 

5 lipase, a peroxidase, an amylolytic enzyme and/or a cellulase. 

33. A composition comprising: 

(i) a mixture of the a~amylase from lichenlformis having the 
sequence shown in SEQ ID NO: 4 with one or more variants 
10 according to any of claims 1 to 17 derived from (as the parent 
Termamyl-like a-amylase) the B, stearoth^rmophilus a-amylase 
p having the sequence shown in SEQ ID NO: 3; or 

't:. (ii) a mixture of the a-amylase from B, stea.z-ot±iex7nophilus 
having the sequence shown in SEQ ID NO: 3 with one or more 
1-^5 variants according to any of claims 1 to 17 derived from one or 
|.-:. more other parent Termam^-i -like a -amylases; or 

Ui (iii) a mixture of one or more variants according any of claims 
p,,, 1 to 17 derived from (as the parent Termamyl-like a-amylase) the 
^'^■^ B. stearothermophilus a-amylase having the sequence shown in SEQ 
_p ID NO; 3 with one or more variants according to the invention 
.3 derived from one or more other parent Termamyl-like a-amylases. 

34 . The composition comprising a variant of any of claims 1 to 
17 wherein the parent a-amylase is a hybrad alpha-amylase 
2S comprising a N-terminal part of the B. amylaliquefaciens a- 
amylase shown in SEQ ID NO: 5 and a C- terminal part of the B. 
lichenlformis a-amylase shown in SEQ ID NO; 4. 

35. The composition according xlo claim 34; wherein the parent 
30 hybrid Termamyl-like a-amylase is LE174 

36. The composition accojrding to claims 35, wherein the parent 
Termamyl-like a-amylase is LE174 with an alterantion in one or 
more of the following positions; K176, 12 01 and H205 . 
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37- The composition according to claims 36, wherein the parent 
Termamyl-like a-amylase is LE174 with one or more of the 
following substitutions: K176R, I201F and H2051Sr, 

38. Use of an a-amylase variant according to any one of claims 1 
to 17 or a composition according to claims 33 to 37 for washing 
and/or dishwashing . 



10 33, Use of an a-amylase variant according to any one 
to 17 or a composition according to claims 3 3 to 3 7 
Q desizing- 

^j; 40. Use of an a-amylase variant according to any of claims 1 to 
i: ^15 17 or a composition according to claims 33 to 37 for starch 
1 ique f ac t i on . 

41. A method for generating a variant of a parent Termamyl -1 ike 
[^^ a-amylase, which variant exhibits increased stability at high 
^^^0 temperatures relative to the parent, the method comprising: 
'11 (a) subjecting a DNA sequence encoding the parent Termamyl -like 

a-amylase to random mutagenesis, 

(b) expressing the mutatetd DNA sequence obtained in step (a) in 
a host cell, and 

25 (c) screening for host cells expressing a mutated a-amylase 
which has increased stability at high temperatures relative to 
the parent Termamyl -like a-amylase. 



of claims 1 
for textile 
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Title: a-amylase variants 
ABSTRACT 

The invention relates to a variant of a parent Termamyl- 

5 like 

a-amylase, comprising mutations in two, three, four^ five or 
six regions/positions- The variants have increased stability at 
high temperatures (relative to the parent) , The invention also 
relates to a DNA constrxict comprising a DNA sequence encoding an 
10 a-amylase variant of the invencion, a recombinant expression 
vector which carries a DNA construct of the invention, a cell 
which is transformed with a DNA construct of the invention, the 
i,f^ use of an a-amylase variant of the invention for washing and/or 
^ dishwashing, textile desizing, starch liquefaction, a detergent 
1%, 15 additive comprising an a-amylase variant of the invention, a 
yj manual or automatic dishwashing detergent composition comprising 
j,l an a-amylase variant of the invention, a method for generating 

a variant of a parent Termamyl-like a-amylase^ which variant 
|: ^ exhibits increased. 
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1 HHNGTNGTMM QYFEWHLPND GNHWNRLRDD ASNL.RNRGIT AIWIPPAWKG 

5 2 * .NGTNGTMM QYFEWYLPND GNHWNRLRSD ASNLKDKGIS AWIPPAWKG 

3 HHNGTNGTMM QYFEWYLPND GNHWNRLRDD AANLKSKGIT AWIPPAWKG 

• 4 ....VNGTLM QYFEWYTPND GQHWKRLQND AEHLSDIGIT AVWIPPAYKG 

5 , .ANLNGTIiM QYFEWYMPND GQHWRRLQND SAYLAEHGIT AVWIPPAYKG 

6 .AAPFNGTMM QYFEWYLPDD GTLWTKVAI^IE AbOTLSSLGIT ALWLPPAYKG 



1 TSQNDVGYGA YDLYDLGEFN QKGTVRTKYG TRSQLESAIH ALKISTNGVQVY 

2 ASQNDVGYGA YDLYDLGEFN QKGTIRTKYG TRNQLQAAVN ALKSNGXQVY 

3 TSQNDVGYGA YDLYDLGEFN QKGTVRTKYG TRNQLQAAVT SLKNNGIQVY 
/ IS 4 LSQSDNGYGP YDLYDLGEFQ QKGTVRTKYG TKSELQDAIG SLHSRNVQVY 
- 5 TSQADVGYGA YDLYDLGEFH QKGTVRTKYG TKGELQSAIK SLHSRDIlslVY 

6 TSRSDVGYGV YDLYDLGEFN QKGTVRTKYG TKAQYLQAIQ AAHAAGMQVY 

^ 101 ^ ISO 

I 20 I GDWMNHKGG ADATENVLAV EVNPNOTLiSrQE ISGDYTIEAW TKFDFPGRGN 

1 2 GDWMNHKGG ADATEMVRAV EVISTPMIRNQE VSGEYTIEAW TKFDFPGRGN 

2 GDWIVINHKGG ADGTEIVNAV EVNRSNRNQE TSGEYAIEAW TKFDFPGRGN 

4 GDWLNHKAG ADATEDVTAV EVNPANRNQE TSEEYQIKAW TDFRFPGRGN 

5 GDWINHKGG ADATEDVTAV EVDPADRNRV ISGEHIilKAW THFHFPGRGS 
25 6 ADWFDHKGG ADGTJEWVDAV EVMPSDRETQE ISGTYQIQAW TKFDFPGRGN 



1 TYSDFKWRWY HFDGVDWDQS RQFQNRIYKF RGDGKAWDWE VDSENGNYDY 

2 THSNFKWRWY HFDGVDWDQS RKLNNRIYKF RGDGKGWDWE VDTENGNYDY 
30 3 NHSSFKWRWY HFDGTDWDQS RQLQNKIYKF RGTGKAWDWE VDTENGNYDY 

4 TYSDFKWHWY HFDGADWDHS RKI,SRIPKF RGEGKAWDWE VSSENGNYDY 

5 TYSDFKWHWY HFDGTDWDES RKL.NRIYKF _ QGKAWDWE VSNENGNYDY 

6 TYSSFKWRWY HFDGVDWDES RKL.SRIYKF RGIGKAWDWE VDTENGNYDY 



10 



51 



100 



151 



200 
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201 

1 LMYADVDMDH 

2 LMYADIDMDH 

3 LMYADVDMDH 

4 LMYADVDYDH 

5 LMYADIDYDH 

6 LMYADLDMDH 



PEWisTELRRW 
PEWNELRNW 
PEVIHELKNTW 
PDWAETKKW 
PDVAASIKEIW 
PEWTSLKNW 



GEWYTNTLNL 
GVWYTNTLGL 
GWYTNTLNL 
GIWYANELSL 
GTWYANELQL 
GKWYVNTTNI 



DGFRIDAVKH 
DGFRIDAVKH 
DGFRIDAVKH 
DGFRIDAAKH 
DGFRLDAVKH 
DGFRLDAVKH 



250 
IKYSFTRDWL 
IKYSFTRDWS 
IKYSFTRDWL 
IKFSFLRDWV 
IKFSFLRDWV 
IKFSFFPDWL 



15 



20 



25 



30 



35 



251 

1 THVRNATGKE 

2 IHVRSATGKN 

3 THVRNTTGKP 

4 QAVRQATGKE 

5 NHVREKTGKE 

6 SYVRSQTGKF 

301 

1 SNSGGNYDMA 

2 'SKSGGNYDMR 

3 SNSGGYYDMR 

4 SSQGGGYDMR 

5 STQGGGYDMR 

6 SKSGGAFDMR 

351 

1 LAYALILTRE 

2 IiAYALTLTRE 

3 IxAYALVLTRE 

4 liAYAFILTRE 

5 LAYAFILTRE 
5 LAYAFILTRQ 



MFAVAEFWKN 
MFAVAEFW]<N 
MFAVAEFWKN 
MFTVAEYWQN 
MFTVAEYWQN 
LFTVGEYWSY 



KLLNGTWC3K 
QIFNGTWQR 
NILNGSWOK 
RLLDGTWSR 
KLLiNGTWSK 
TLMTNTLiMKX) 



QGYPSVFYG^D 
QGYPSVFYGD 
QGYPSVFYGD 
SGYPQVFYGD 
SGYPQVFYGD 
EGYPCVFYGD 



DLGALtENYLN 
DLGAIENYLN 
DLGAIENYLN 
NAGKLENYLKT 
DLGALENYLN 
DINKLHNYIT 



HPMHAVTFVD 
HPMHAVTFVD 
HPTHAVTFVD 
HPEKAVTFVE 
HPLKSVTFVD 
QPTLAVTFVD 



YYGIPTHS. . 
YYGIPTHG- , 
YYGIPTHG. - 
MYGTKGTSPK 
MYGTKGDSQR 
YYGIPQYN. . 



KTNWNHSVFD 
K-mWNHSVPD 
KTSWNHSAFD 
KTSFNQSVFD 
KTNFNHSVFD 
KTDGTMSLFD 



NHDSQPGESIi 
NHDSQPEEAL 
NHDSQPGEAL 
NHDTQPGQSL 
NHDTQPGQSL 
NHDTEPGQAL' 



. VPAMKAKID 
.VPAMKSKID 
.VPAMKSKID 
EIPSLKDNIE 
EIPALKHKIE 
.IPSLKSKID 



401 

1 YGTQHDYFDH 

2 YGRQN 

40 3 YGTQHDYFDH 

4 YGPQHDYIDH 

5 YGAQHDYFDH 

6 YGTQHDYLDH 



HNITGWTREG NTTHPNSGLA TIMSDGPGGE 



300 
VPLHYNLYNA 
VPLHYNFYNA 
VPLHYNLYNA 
VPLHFNLQAA 
VPLHYQPHAA 
APLHNKFYTA 

350 
ESFVQEWFKP 
ESFVEEWFKP 
ESFVQQWFKP 
ESTVQTWFKP 
BSTVQTWFKP 
QSWVDPWFKP 

400 
PILEARQNFA 
PILEARQKYA 
PLLQARQTFA 
PILKARKEYA 
PILKARKQYA 
PLLIARRDYA 

450 
KWiyiYVGQNKA 



HDIIGWTREG 
PDVIGWTREG 
HDIVGV7TREG 
SDIIGWTREG 



NSSHPNSGIjA 
DSSAA.KSGLA 
DSSVANSGLA 
GTEKPGSGLA 



TIMSDGPGGN 
ALITDGPGGS 
ALITDGPGGA 
ALITDGPGGS 



KWMYVGKNKA 
KRMYAGLKlNrA 
KRiVIYVGRQNA 
KWMYVGKQHA 



Fig. 1 (continued) 
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451 500 

5 1 GQVWHDITGN KPGTVTINAD GWANFSVNGG SVSIWVKR 

2 

3 GQVWRDITGN RTGTVTINAD GWGNFSVNGG SVSVWVKQ 

4 GETWYDITGN RSDTVKIGSD GWGEFHVNDG SVSIYVQ 

5 GETWHDITGN RSEPWINSE GWGEPHVNGG SVSIYVQR 

10 6 GKVPYDLTGN RSDTVTINSD GWGEFKVNGG SVSVWVPRKT TVSTIARPIT 

501 519 

1 

2 

15 3 

4 

5 . , 

6 TRPWTGEFVR WTEPRLVAW 



Fig. 1 (continued) 
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LEI 74 



ECORV 



EagI 



IB 



aSER 



-6- 



bSER 



< — 

#63 



i 



Mutated fragment 

e 



Fig. 2 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 
(i) APPLICANT: 
5 (A) NAME: NOVO NORXiISK A/S 

(B) STREET: Novo Alle 

(C) CITY: DK-2e60 Bagsvaerd 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP): DK-28S0 

(G) TELEPHONE: +45 44 44 38 88 

(H) TELEFAX: +45 44 43 32 56 
(il) TITLE OF INVENTION: a-amylage variants 

(ill) NUMBER OF SEQUENCES- 32 ^-^^^-^^niis 
(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM; PC-DOS/MS-DOS 

(2) INFORMATION FOR SEQ ID NO: 1; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 

-r? (C) STRANDEDNESS : single 

:i (D) TOPOLOGY: linear 

:^2 5 (ii) MOLECULE TYPE: protein 

;l (iii) Organism: Bacillus sp, 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO - 1- 



15 



20 



^^3 0 



45 



50 



50 



65 



His His Aen Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp ivr 

Leu Pro Asxx A^p Gly Aen Bis Trp Asn Arg Leu Arg A^p Asp Ala Ala 

Asn Leu Lys S^r Lys Gly He Thr Ala Val Trp He Pro Pro Ala Trp 
35 40 45 ^ 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 

55 60 

Aap Leu Oly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 



80 

Thr Arg Asn Gin Leu Gin Ala Al^ Val Thr Ser Leu Lys Asn Asn Gly 
^5 90 95 " 

rie Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 



110 



Gly Thr Glu He Val Asii Ala Val Glu Val Aen Arg Ser Asn Arg Asn 

120 125 

Gin Glu Thr ser Gly Glu Tyr Ala He Glu Ala Trp Thr Lys Phe ASp 

Phe Pro Gly Arg Gly Aen Asn His ser Ser Phe Lys Trp Arg Trp Tyr 

155 160 

His Phe Asp Gly Thr Asp Txp Asp Gin Ser Arg Gin Leu Gin Asn Lys 
lo5 170 

He Tyr Lys Phe Arg Gly Thx Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 195 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asc Met 
135 200 205 



15. NOV. 1 999 1 8: W 
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Asp His Pro GlxL Val rle His Glu Leu Ar^ Asn Trp Gly Val Trp Tyr 
210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
5 225 230 235 24=0 

He Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
245 250 255 

10 Thr Gly Lye Pro Men Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 

260 265 270 



15 



4:5 



Gly Ala He Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
275 260 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
230 295 300 



Gly Tyr Tyr Asp Met Arg Asn He Leu Asn Gly Ser Val Val Gin Lys 
20 305 310 315 320 

His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

J 25 Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 

340 345 350 

L Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 

355 360 ^ 365 



Gly Asp Tyr Tyr Gly He Pro Thr His Gly Val Pro Ala Met Lyg Ser 
370 375 360 



Lys He Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Gly Thr 
,',.b35 385 ' 390 395 400 



: Gin His Asp Tyr Phe Asp His His Asp He He Gly Trp Thr Arg Glu 

1-^^ 405 410 415 

"'^■l4 0 Gly Asn ser Ser His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 

•j' 420 425 430 

Gly Pro Gly Gly Asn Lys Trp Met Tyr Val Gly Lys Asn Lys Ala Gly 
435 440 445 



Gin Val Trp Arg Aap He Thr Gly Asn Arg Thr Gly Thr Val Thr He 
450 455 460 



Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
50 465 470 475 480 

Val Trp Val Lys Gin 
4S5 

55 (2) IKFOKWATION FOR SEQ ID NO : 2: 
(i) SEQUENCE CHARACTER X STl C3 : 

(A) LENGTH; 495 airiino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 
€0 it)) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(iii) Organism; Bacillus sp . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

6S His His Asn Gly Thr Aan Gly Thr Met Met Gin Tyr Phe Glu Trp His 

15 10 15 



5. NOV. 1 999 1 8:1 7 
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Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ser 
20 25 30 

Asn Leu Arg Asn Arg Gly Tie Thr Ala lie Trp He Pro Pro Ala Trp 
5 35 40 45 

Lys Gly Thr Ser Gin Aan Asp Val Gly Tvr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

10 Asp Leu Gly Glu Phe Asn Gin Lyg Gly Thr Val Arg Thr Lys Tyr Gly 

^5 70 75 ao 



X5 



45 



60 



Thr Arg Ser Gin Leu Glu Sqt Ala lie His Ala Leu Lys Asn Asn Gly 
B5 90 95 

Val Gin Val Tyr Gly Asp Val Val Met Asn Hxs Lys Gly Gly Ala Asp 
100 105 110 



Ala Thr Glu Aen Val Leu Ala Val Glu Val A^n Pro Asn Asn Arg Asn 
20 115 120 125 

Gin Glu He Ser Gly Asp lyr Thr He Glu Ala Trp Thr Lys Phe Asp 

:J 130 13 5 14 0 

■"'25 Phe Pro Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp Arg Trp Tyr 

145 150 155 leO 

His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Gin Phe Gin Asn Arg 
1^5 170 175 

-^3 O 

He Tyr Lys Phe Arg Gly Asp Gly Lys Ala Trp Asp Trp Glu Val Asp 
* ISO 135 190 

Ser Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
,;35 135 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Arg Trp Gly Glu Trp Tyr 

11 210 215 220 

;4o Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg He Asp Ala Val Lys His 

y 225 230 255 240 

He Lys Tyr Ser Phe Thr Acg Asp Trp Leu Thr His Val Arg Asn Ala 
245 250 255 



Thr Gly Lys Glu Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
2S0 2S5 270 



Gly Ala Leu Glu Aan Tyr Leu Asn Lys Thr Asn Trp Asn His Ser Val 
50 275 2S0 265 

Phe Asp Val Pro Leu His T^^r Asn Leu Tyr Asn Ala Ser Asn ser Gly 
290 2;?5 ' 300 

Gly Asn Tyr Asp Met Ala Lys Leu Leu Asn Gly Thr Val Val Gin Lys 
305 310 315 320 

His Pro Met His Ala Val Thr Phe Val A^sp Asn His Asp Ser Gin Pro 
325 330 335 



Gly Glu Ser Leu Glu Ser Phe Val Gin Glu Trp Phe Lys Pro Leu Ala 
340 345 350 



Tyr Ala Leu He Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
65 355 360 3^5 

Gly Asp Tyr Tyr Gly He Pro Thr His Ser Val Pro Ala Met Lys Ala 



NOV, 1 999. 1 8:1 8 
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370 375 

Lys Xle Asp Pro lie Leu Glu Ala Arg Gin Asn Phe Ala Tyr Glv Tiir 

395 350 ---- ' 



335 400 



Gin His Asp Tyr Phe Asp His His Asn He lie Gly Trp Thr Arg Glu 
405 410 415 

Gly Asn Thr Thr His Pro Asn Ser Gly Leu Ala Thr lie Met Ser Asp 
^0 420 425 430 

Gly Pro Gly Gly Glu Lys Trp Met Tyr Val Gly Gin Asn Lys Ala Gly 
435 440 445 

Gin Val Trp His Asp He Thr Gly Asn Lys Pro Gly Thr Val Thr He 

455 

Asn Ala Asp Gly Trp Ala Asn Phe Ser Val Asn Glv Glv Ser Val Ser 
465 470 47S * ' 480 

He Tzrp Val Lys Arg 
4S5 



IS 



20 



45 



(2) INFORMATION FOR SEQ ID NO: 3: 
2S (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH; 514 amino acids 
(B) TYPE: amino acid 

CC) STRANDEDNESS : single 

CD) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: protein 

(iii) Organism: Bacillus stearothermoTDhilus . 
(Xl) SEQtTENCE DSSCRIPTION: SEQ ZD No"": 3: 

Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr Leu 
^ 5 10 15 

Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn 
20 25 30 

40 Leu Ser Ser Leu Gly He Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys 

35 40 45 

Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp 
50 55 SO 

Leu Gly Glu Phe Asn Gin Lys Gly Ala Val Arg Thr Lys Tvr Glv Thr 
^5 70 75 ^ QQ 

Lys Ala Gin Tyr Leu Gin Ala He Gin Ala Ala His Ala Ala Gly Met 
50 85 90 95 

Gin Val Tyr Ala Asp Val Val Phe Asp His Lys Glv Gly Ala Asp Gly 
100 105 ' 

Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg Asn Gin 

120 125 

Glu He Ser Gly Thr Tyr Gin He Gin Ala Trp Thr Lys Phe Asp Phe 
130 135 140 

Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His 

150 155 ' 160 

Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg He Tyr 
165 170 175 

Lys Phe Arg Gly He Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu 



55 



60 
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ISO 135 150 

Asn Gly Aan Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His 
19S 200 205 

5 

Pro Glu Val Val Thr Glu Leu Lys Ser Trp Gly Lys Trp Tyr Val Aan 
210 215 220 

Thr Thr Asn He Asp Gly Phe Arg Leu Asp Ala Val Lys His lie Lys 
10 225 2ao 2j5 240 

Phe Ser Phe Phe Pro Asp Trp Leu Ser Asp Val Arg Ser Gin Thr Gly 
245 250 2S5 

15 Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp He Asn Lys 

260 265 270 



20 



35 



50 



65 



Leu His Asn Tyr lie Met Lys Thr Asn Gly Thr Met Ser Leu Phe Asp 
275 2S0 2B5 

Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Thr 
290 295 200 



;:; Phe Asp Met Arg Thr Leu Met Thr Asn Thr Leu Mer Lys Asp Gin Pro 

^ 25 305 310 315 320 

Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Glu Pro Gly Gin 
325 330 335 

3 0 Ala Leu Gin Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala 

.1 340 345 350 

Phe rie Leu Thr Arg Gin Glu Gly Tvr Pro Cys Val Phe Tyr Gly Asp 
3 55 3 60 3S5 



Tyr Tyr Gly He Pro Gin Tyr Asn lie Pro Ser Leu Lys Ser Lye lie 
370 375 330 



Asp Pro Leu Leu He Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin His 
40 385 390 395 400 

Asp Tyr Leu Asp His Ser Aep He He Gly Trp Thr Arg Glu Gly Val 
405 410 415 

45 Thr Glu Lys Pro Gly Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro 

420 425 430 

Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gin His Ala Gly Lys Val 
435 * 440 445 



Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr He Asn Ser 
450 45S 460 



Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser Val Trp 
S5 465 470 475 480 

Val Pro Arg Lys Thr Thr Val Ser Thr He Ala Trp Ser He Thr Thr 
485 490 495 

60 Arg Pro Trp Thr Asp Glu Phe Val Arg Trp Thr Glu Pro Arg Leu Val 

500 505 510 



Ala Trp 



(2) INFORMATION FOR SEQ ID NO ; 4: 
i±) SSQUEI^CE CHARACTERISTICS: 



15, NOV, 1 999 1 8; i9 NOVO NORD'.SK PAT DP r^5 ^^493256 ~ NO. 2740 ?, 32/50 
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(A) LENGTH: 483 amino acids 

(B) TYPS: amxno acid 

(C) STRANDEDKESS ; single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: procexn 

( i i i ) Organi sm : Bacillus lichenif ormis 
(xi> SEQUENCE DESCRIPTION: SEQ ID NO : 4: 

Ala Asn LavL Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Met Pro 
10 a 5 10 15 

Asn Asp Gly Gin His Trp Arg Arg Leu Gin Asn Asp Ser Ala Tyr Leu 
20 25 30 

15 Ala Glu His Gly lie Thr Ala Val Trp He Pro Pro Ala Tyr Lys Gly 

35 40 45 



20 



35 



6S 



Thr Ser Gin Ala A9p Val Gly Tyr Gly Ala Tyr Asp Leu Tvr Asp Leu 

50 55 60 

Gly Glu Pile His Gin Lys Gly Thr val Arg Thr Lys Tyr Gly Thr Lys 

65 70 75 80 



^^^^ Gly Glu Leu Gin Ser Ala He Lys Ser Leu His Ser Arg Asp He Asn 

25 as 90 95 

f ! val Tyr Gly Asp Val Val He Asn His Lye Gly Gly Ala Asp Ala Thr 

100 105 110 

i2. 3 0 Glu Asp Val Thr Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val 

n 115 12 0 12 5 



He Ser Gly Glu His Leu He Lys Ala Trp Thr His Phe His Phe Pro 

130 135 140 

Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe 

145 150 155 1€0 



"^^^ Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn Axg He Tyr Lys 

ibO40 1$5 170 175 

Phe Gin Gly Lys Ala Trp Asp Trp Glu Val Ser Asn Glu Asn Gly Asn 
ISO " 185 190 

45 Tyr Asp Tyr Leu Met Tyr Ala Asp He Asp Tyr Asp His Pro Asp Val 

155 200 205 

Ala Ala Glu He Lys Arg Trp Gly Thr Trp Tyr Ala Asn Glu Leu Gin 
210 215 220 

SO 

Leu Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys Phe Ser Phe 
225 230 235 240 

Leu Arg Asp Trp Val Asn His Val Arg Glu Lys Thr Gly Lys Glu Met 
55 245 250 255 

Phe Thr Val Ala Glu Tyr Trp Gin Asn Asp Leu Gly Ala Leu Glu Asn 
2S0 265 270 

60 Tyr Leu Asn Lya Thr Asn Phe Asn His Ser Val Phe Asp Val Pro Leu 

275 280 285 

His Tyr Gin Phe His Ala Ala Ser Thr Gin Gly Gly Gly Tyr Asp Met 
230 255 300 



Arg Lys Leu Leu Aen Gly Thr Val Val Ser Lys His Pro Leu Lys Ser 
305 310 315 320 



15, NOV. 1 999 IS; 19 NOVO NOROISK PAT DP 4^^93256 . NO. 2740 P. 33/50 



Val Thr Phe Val Asp Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu 
325 330 335 

5 Ser Thr Val Qln Thr Trp Ph.e Lys Pro Leu Ala Tyr Ala Phe He Leu 

340 345 350 

Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly 
355 360 365 

10 

Thr Lys Gly Asp Ser Gin Arg Glu lie Pro Ala Leu Lys His Lys He 
370 375 390 

Glu Pro He Leu Lys Ala Arg Lys Gin Tyr Ala Tyr Gly Ala Gin His 
15 335 ' 350 * 395 400 

Asp Tyr Phe Asp His His Asp He val Gly Trp Thr Arg Glu Gly Asp 
405 410 415 

2 0 Ser Ser Val Ala Asn Ser Gly Leu Ala Ala Leu He Thr Asp Gly Pro 

420 425 430 



25 



Gly Gly Ala Lys Arg Met Tyr Val Gly Arg Gin Asn Ala Gly Glu Thr 
435 440 445 

Trp His Asp He Thr Gly Asn Arg Ser Glu Pro Val Val He Asn Ser 
450 455 460 



i Glu Gly Trp Gly Glu Phe His Val Asn Gly Gly Ser Val Ser He Tyr 

^30 465 470 475 4S0 

Val Gin Arg 

"35 (2) XUFORMATION FOP. SSQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 
:^ (A) LRtTGTH: 49 0 amino acids 

(B) TYPE: amino acid 
' (C) STRANDSDNESS : Single 

Ijio (D) TOPOLOGY; linear 

-i (ii) MOLECULE TYPE: protein 

'"^ (iii) Organis-m: Bacillus amyloliquef aciene 

(Xi) SEQUENCE DESCRIPTIOK: SEQ ID NO; 5: 

45 Val Aen Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Thr Pro Asn Asp 

15 10 IS 

Gly Gin His Trp Lys Arg Leu Gin Asn Asp Ala Glu His Leu Ser Asp 
20 25 30 

He Gly He Thr Ala Val Trp He Pro Pro Ala Tyr Lys Gly Leu Ser 
35 40 45 



50 



Gin Ser Asp Asn Gly Tyr Gly Pro Tyr Asp Leu Tyr Asp Leu Gly Glu 
55 50 5i5 60 

Phe Gin Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ser Glu 
€S 70 75 80 

60 Leu Gin Asp Ala He Gly Ser Leu His Ser Arg Asn Val Gin Val Tyr 

85 90 95 

Gly Asp Val Val Leu Asn His Lys Ala Gly Ala Asp Ala Thr Glu Agp 
100 105 110 



ss 



Val Thr Ala Val Glu Val Asn Pro Ala Asn Arg Asn Gin Glu Thr Ser 
115 120 125 



NOV. 1999 18:20 
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Glu Glu Tyr Gin He Lys Ala Trp Thr Asp Phe Arg Phe Pro Gly Arg 
130 135 140 

Gly Ash Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe Asp Gly 
14S 150 * 155 160 

Ala Asp Trp Asp Glu Ser Arg Lys He Ser Arg He Phe Lys Phe Arg 
165 170 ^ 175 

Gly Glu Gly Lys Ala Trp Asp Trp Glu Val Ser Ser Glu Asn Gly Asn 
180 135 190 

Tyr Asp Tyr Leu Met Tyr Ala Aep Val Asp Tyr Asp His Pro Asp Val 
15 195 200 205 

Val Ala Glu Thr Lys Lys Trp Gly He Trp Tyr Ala Asn Glu Leu Ser 
210 215 220 

20 Leu Asp Gly Phe Arg He Asp Ala Ala Lys His He Lys Phe Ser Phe 

225 230 235 240 



25 



55 



Leu Arg Asp Trp Val Gin Ala Val Arg Gin Ala Thr Gly Lys Glu Met 
245 250 255 

Phe Thr Val Ala Glu Tyr Trp Gin Asn Asn Ala Gly Lys Leu Glu Asn 
260 265 * 270 



Tyr Leu Asn Lys Thr Ser l?he Asn Gin Ser Val Phe Asp Val Pro Leu 

ho 275 280 285 

I His Phe Asn Leu Gin Ala Ala Ser Ser Gin Gly Gly Gly Tyr Asp Met 

^ 230 295 ' 300 

,35 Arg Arg Leu Leu Asp Gly Thr Val Val Ser Arg His Pro Giu Lys Ala 

305 310 315 320 

1 Val Thr Phe Val Glu Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu 

^ 325 320 335 



Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu 
340 345 350 



Thr Arg Giu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met: Tyr Gly 
45 355 260 ^ 365 

Thr Lys Gly Thr Ser Pro Lys Glu He Pro Ser Leu Lys Asp Asn He 
370 375 380 

5 0 Glu Pro He Leu Lys Ala Arg Lys Glu Tyr Ala Tyr Gly Pro Gin Kis 

335 390 395 400 



Asp Tyr He Asp His Pro Asp Val He Gly Trp Thr Arg Glu Gly Asp 
405 410 415 

Ser Ser Ala Ala Lys Ser Gly Leu Ala Ala Leu He Thr Asp Gly Pro 
420 425 430 



Gly Gly Ser Lys Arg Met Tr/r Ala Gly Leu Lys Asn Ala Gly Glu Thr 
so 435 440 445 

Trp Tyr Asp He Thr Gly Asn Arg Ser A.sp Thr Val Lys lie Gly Ser 
450 455 4S0 

65 Asp Gly Trp Gly Glu Phe His Val Asn Asp Gly Ser Val Ser He Tyr 

465 470 475 480 



NOV 1 999 1 8:2 1 NOVO NORDISK PAT DP +^5 ^M93256 
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(2) INFORMATION FOR SEQ ID NO : 6: 
(i) SSQUSNCE CHAI^CTERISTICS : 
S (A) LENGTH: 4 65 amino acids 

(B) TYPE; amino acid 

(C) STRANDHDNESS I sinsie 

(D) TOP0I.OGy; linear 
(ii) MOLECULE TYPE: protein 

10 (iii) Organism: Bacillus sp. 

(xi) SSQUEUCE DESCRIPTION: SEQ ID NO : 6: 



IS 



His His Asn Gly Thr Ann Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
15 10 15 

Leu Pro Asn Aep Gly Asn His Trp Asn Arg Leu Asn Ser Asp Ala Ser 
20 25 30 

20 Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp He Pro Pro Ala Trp 

35 40 45 

Lys Gly Ala Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 €0 



25 



40 



55 



Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 eo 



Thr Arg Ser Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
30 B5 50 95 

He Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

35 Ala Thr Glu Met Val Arq Ala Val Glu Val Asn Pro Asn Asn Arg Aen 

115 120 125 

Gin Glu Val Thr Gly Glu Tyr Thr He Glu Ala Trp Thr Arg Phe Asp 
130 135 140 



Phe Pro Gly Arg Gly Asn Thr His Ser Ser Phe Lys Trp Arg Trp Tyr 
145 150 155 160 



His Phe Aep Gly Val Asp Trp Asp Gin Ser Arg Arg Leu Asn Asn Arg 
45 \€S 170 175 

He Tyr Lys Phe Arg Gly His Gly Lys Ala Trp Asp Trp Glu Val Asp 
ISO 165 150 

SO Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr AJLa Asp He Asp Met 

155 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Aen Trp Gly Val Trp Tyr 
210 215 220 



Thr Asn Thr Leu Gly Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
225 230 235 240 



He Lys Tyr Ser Phe Thr Arg Asp Trp He Asn His Val Arg Ser Ala 
60 245 250 255 

Thr Gly Lys Asn Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

BS Gly Ala He Glu Asn Tyr Leu Gin Lys Thr Asn Trp Asn His Ser Val 

275 280 285 



5. NOV. 1 9 99 1 8:2 1 NOVO MORDiS< PAT DP +45 ^^493256 MO. 2740 P. 36/5 
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Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Lys Ser Gly 
230 29B 300 

Gly Asn Tyr Asp Met Arg Asn lie Phe Asa Gly Thr Val Val Gin Arg 
5 305 310 315 320 

His Pro Ser His Ala Val Thr Phe Val Aep Asn His Asp Ser Gin Pro 
325 330 335 

10 Glu Glu Ala Leu Glu Ser Phe Val Glu Glu Trp Phe Lys Pro Leu Ala 

340 345 350 



IS 



Tyr Ala Leu Thr Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 

355 360 355 

Gly Asp Tyr Tyr Gly lie Pro Thr His Gly Val Pro Ala Met Arg Ser 

370 375 390 



Lys He Asp Pro He Leu Glu Ala Arg Gin Lys Tyr Ala Tyr Gly Lys 

20 385 3.90 395 400 

Gin Aen Asp Tyr Leu Aisp His His Asn He He Gly Trp Thr Arg Glu 

405 410 415 

25 Gly Asn Thr Ala His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 

420 425 430 



30 



Gly Ala Gly Gly Ser Lys Trp Met Phe Val Gly Arg Asn Lye Ala Gly 
435 440 445 

Gin Val Trp Ser Asp He Thr Gly Asn Arg Thr Gly Thr Val Thr He 
450 455 460 



ASn Ala Asp Gly Trp Gly ABn Phe Ser Val Asn Gly Gly Ser Val Ser 
35 4SS 470 475 490 

He Trp Val Asn Lys 
485 

40 (2) IITFORMATION" FOR SEQ ZD KO : 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4S5 amino acids 

(B) TYPE; amino acid 

(C) STKANDEDKESS : single 
45 iv) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) Organism: Bacillus sp . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 

50 His His Asn Gly Thr Abii Gly Thr Met Met Gin Tyr Phe Glu Trp Tvr 

15 10 15 



55 



Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 
20 25 30 

Asn Leu Lys Ser Lys Gly He Thr Ala Val Trp He Pro Pro Ala Trp 
35 40 45 



Lys Gly Thr Ser Gin Asi: Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
GO 50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

65 Thr Arg Aen Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 

B5 90 95 



5. NOV. 1 999 1 3:22 ^OVO NORDISK PAT D? -45 ^m2'A ^0.2740 P. 3//5( 
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lie Gin Val Tyr Gly hsp Val Val Met Asn His Lys Gly Gly Ala Asp 

100 105 110 

Gly Thr Glu He Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 
5 115 120 125 

Gin Glu Thr Ser Gly Glu Tyr Ala He Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

10 Phe Pro Gly Arg Gly Aen Asn His Ser Ser Phe Lys Trp Arg Trp Tvx 

145 150 155 ' 160 



15 



3Q 



4S 



60 



His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin X^eu Gin Asn I^ys 
165 170 175 

He Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 

160 185 190 



Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Mec Tyr Ala Asp Val Asp Met 
20 195 200 a05 

Asp His Pro Glu Val He His Olu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

2S Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg He Asp Ala Val Lys His 

225 2:30 235 240 

He Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
245 250 255 



Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
2S0 265 270 



Gly Ala He Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
35 275 260 285 

Phe Asp Val Pro Leu His xyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

4 0 Gly Tyr Tyr Asp Met Arg Asn He Leu Asn Gly Ser Val Val Gin Lys 

305 310 315 320 

His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 



Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 
340 345 350 



Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
50 355 3^0 365 

Gly Asp Tyr Tyr Gly He Pro Thr His Gly Val Pro Ala Met Lys Ser 
370 375 360 

55 Lys He Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Glv Thr 

3S5 350 395 ' 400 



Gin His Aep Tyr Phe Asp Kis His Asp He He Gly Trp Thr Arg Glu 
405 410 415 

Gly Asn Ser Ser His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 
420 425 430 



Gly Pro Gly Gly Asn Lys Trp Met Tyr Val Gly Lys Asn Lys Ala Gly 
SS 435 440 ' 445 

Gin Val Trp Arg Asp He Thr Gly Asn Arg Thr Gly Thr Val Thr He 



15. NOV. 1 999 1 8:22 
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450 455 



460 



Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
, ^^O 475 430 

Val Trp Val Lys Gin 
485 

(2) INFORMATION FOR SEQ ID NO; 8: 
10 (i) SEQUENCE CIJARACTERISTICS : 

(A) LENGTH: 4 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS r sinale 

(D) TOPOLOGY: linear 
15 (ii) MOLECULE TYPE: protein 

(iii) Organism: Bacillus sp . 

(xi) SEQUENCE DESCRIPTION: SEQ ID KO : B: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe GXu Trp His 
20 1 B 10 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ser 
20 25 30 

25 Asn Leu Arg Asn Arg Gly He Thr Ala He Trp He Pro Pro Ala Trp 

35 40 45 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tvr Asp Leu Tyr 

30 " 60 - ^ 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tvr Glv 

70 75 ' SO 

Thr Arg Ser Gin Leu Glu Ser Ala He Hie Ala Leu Lys Asn Asn Glv 
3^ 85 90 95 

val Gin Val Tyr Gly ABp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

40 Ala Thr Glu Asn Val Leu Ala Val Glu Val Asn Pro Asn Asn Arg Asn 

115 120 125 

Gin Glu He Ser Gly Asp Tyr Thr He Glu Ala Trp Thr Lys Phe Asp 

135 

45 

Phe Pro Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp Arg Trp Tyr 

150 155 

Hxs Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Gin Phe Gin Asn Arg 

50 165 170 

He Tyr Lys Phe Arg Gly Asp Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 1S5 

5 5 Ser Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 

200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Arg Trp Gly Glu Trp Tvr 
210 215 220 

Thr Asn Thr Leu Aen Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
225 2.30 235 240 

lie Lye Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Ala 
245 250 255 

Thr Gly Lys Glu Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 



£0 
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260 265 270 

Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Trp Asn His Ser Val 
275 2S0 2S5 

5 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Asn Tyr Asp Met Ala Lys Leu Leu Asn Gly Thr Val Val Gin Lys 
10 305 310 315 320 

His Pro Met His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

15 Gly Glu Ser Leu Glu Ser Phe val Gin Glu Trp Phe Lys Pro Leu Ala 

340 345 350 



20 



Tyr Ala Leu He Leu 'Ihr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 3^0 365 

Gly Asp Tyr Tyr Gly :Cle Pro Thr His Ser Val Pro Ala Met Lvs Ala 
370 375 380 



Lys He Asp Pro Xle Leu Glu Ala Arg Gin Asn Phe Ala Tyr Gly Thr 
25 385 390 395 400 

Gin His Asp Tyr Phe Asp His His Asn He He Gly Trp Thr Arg Glu 
405 410 415 

3 0 Gly Asn Thr Thr His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 

420 425 430 



35 



Gly Pro Gly Gly Glu Lys Trp Met Tyr Val Gly Gin Asn Lys Ala Gly 
435 440 445 

Gin Val Trp His Asp He Thr Gly Aen Lys Pro Gly Thr Val Thr Xle 
450 455 460 



Aen Ala Asp Gly Trp Ala Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
40 455 470 475 480 

He Trp Val Lys Arg 
485 

45 (2) INFORMATION FOR SEQ ID KO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANLEPKESS : single 
5 0 (D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA (senomic) 

(iii) Organism: Bacillus sp . 

(xi) SEQUENCE DESCRIPTION: SEO ID KO: 9: 



5S 


CATCATAATG 


GAACAAATGG 


TACTATGATG 


CAATATTTCG 


AATGGTATTT 


GCCAAATGAC 


60 




GGGAATCATT 


GGAACAGGTT 


GAGG(3ATGAC 


GCAGCTAACT 


TAAAGAGTAA 


AGGGATAACA 


120 


60 


GCTGTATGGA 


TCCCACCTGC 


ATGGAAGGGG 


ACTTCCCAGA 


ATGATGTAGG 


TTATGGAGCC 


IBO 


TATGATTTAT 


ATGATCTTGG 


AGAGTTTAAC 


CAGAAGGGGA 


CGGTTCGTAC 


AAAATATGGA 


240 




ACACGCAACC 


AGCTACAGGC 


TGCGGTGACC 


TCTTTAAAAA 


ATAACGGCAT 


TCAGGTATAT 


300 


65 


GGTGATGTCG 


TCATGAATCA 


TAAAGGTGGA 


GCAOATGGTA 


CGGAAATTGT 


AAATGCGGTA 


360 




GAAGTGAATC 


GGAGCAACCG 


AAACCAGGAA 


ACCTCAGGAG 


AGTATGCAAT 


AGAAGCGTGG 


420 
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ACMAGTTTG ATTTTCCTGG AAGAGGAAAT AACC4TTCCA GCTTTAAGTG GCGCTGGTAT 4 60 

CATTTTGATG GGACAGATTG GGJi.TCAGTCA CGCCAGCTTC AAAACAAAAT ATATAAATTC 54 0 

AGGGGAACAG GCAAGGCCTG GGA.CTGGGAA GTCGATACAG AGAATGGCAA CTATGACTAT SOO 

CTTATGTATG CAGACGTGGA TATGGATCAC CCAGAAGTAA TACATGAACT TAGAAACTGG &€0 

10 GGAGTGTGGT ATACGAATAC ACTGAACCTT GATGGATTTA GAATAGA.TGC AGTGAAACAT 72 0 

ATAAAATTATA GCTTTACGAG AGAXTGGCTT ACACATGTGC GTAACACCAC AGGTAAACCA 73 0 

ATGTTTGCAG TGGCTGAGTT TTGGAAAAAT GACCTTGGTG CAATTGAAAA CTATTTGAAT 840 

15 

AAAACAAGTT GGAATCACTC GGTGTTTGAT GTTCCTCTCC ACTATAATTT GTACAATGCA SOO 

TCTAATAGCG GTGGTTATTA TGATATGAGA AATATTTTAA ATGGTTCTGT GGTGCAAAAA 960 

20 CATCCAACAC ATGCCGTTAC TTTTGTTGAT AACCATGATT CTCAGCCCGG GGAAGCATTG 102 0 

GAATCCTTTG TTCAACAATG GTr:':AAACCA CTTGCATATG CATTGGTTCT GACAAGGGAA 1080 

CAAGGTTATC CTTCCGTATT TTA.TGGGGAT TACTACGGTA TCCCAACCCA TGGXGTTCCG 114 0 

25 

GCTATGAAAT CTAAAATAGA CCCT'CTTCTG CAGGCACGTC AAACTTTTGC CTATGGTACG 12 00 

CAGCATGATT ACTTTGATCA TCATGATATT ATCGGTTGGA CAAGAGAGGG AAATAGCTCC 12 60 

3 0 CATCCAAATT CAGGCCTTGC CACCATTATG TCAGATGGTC CAGGTGGTAA CAAATGGATG 1320 

TATGTGGGGA AAAATAAAGC GGGACAAGTT TGGAGAGA.TA TTACCGGAAA TAGGACAGGC 13 8 0 

ACCGTCACAA TTAATGCAGA CGGATGGGGT AATTTCTCTG TTAATGGAGG GTCCGTTTCG 144 0 

GXTTGGGTGA AGCAA 0.455 



3S 



(2) INPORMAmON FOR SEQ ID UO : 10 i 
(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 
45 (iii) Organism: Bacillus sp, 

(Xi) SEQUENCE DESCRIPTION: SEQ ID mi 10: 

CATCATAATG GGACAAATGG GACGATGATG CAATACTTTG AATGGCACTT GCCTAATGAT 6 0 

50 GGGAATCACT GGAATAGATT AAGAGATGAT GCTAGTAATC TAAGAAATAG AGGTATAACC 120 

GCTATTTGGA TTCCGCCTGC CTGG^iAAGGG ACTTCGCAAA ATGATGTGGG GTATGGAGCC 18 0 

TATGATCTTT ATGATTTAGG GGAATTTAAT CAAAAGGGGA CGGTTCGTAC TAAGTATGGG 24 0 

55 

ACACGTAGTC AATTGGAGTC TGCCJlTCCAT GCTTTAAAGA ATAATGGCGT TCAAGTTTAT 3 00 

GGGGATGTAG TGATGAACCA TAAAGGAGGA GCTGATGCTA CAGAAAACGT TCTTGCTGTC 3 60 

60 GAGGTGAATC CAAATAACCG GAATCAAGAA ATATCTGGGG ACTACACAAT TGAGGCTTGG 42 0 

ACTAAGTTTG ATTTTCCAGG GAGGG^GTAAT ACATACTCAG ACTTTAAATG GCGTTGGTAT 4 SO 

CATTTCGATG GTGTAGATTG GGATCAATCA CGACAATTCC AAAATCGTAT CTACAAATTC 54 0 

CGAGGTGATG GTAAGGCATG GGATTGGGAA GTAGATTCGG AAAATGGAAA TTATGATTAT 600 
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TTAATGTATG 


CAGATGTACA 


TATGGATCAT 


CCGGAGGTAG 


TAAATGAGCT 


TAGAAGATGG 


660 


GGAGAXTGGT 


ATACAAATAC 


ATTAAA.TCTT 


GATGGATTTA 


GGATCGATGC 


GGTGAAGCAT 


720 


ATTAAATATA 


GCTTTACACG 


TGATTGGTTG 


ACCCATGTAA 


GAAACGCAAC 


GGGA^AAGAA 


7S0 


ATGTTTGCTG 


TTGCTGAATT 


TTGGAAAAAT 


GATTTAGGTG 


CCTTGGAGAA 


CTATTTAAAT 


840 


AAAACAAACT 


GGAATCATTC 


TGTCTTTGAT 


GXCCCCCTTC 


ATTATAATCT 


TTATAACGCG 


3O0 


TCAAATAGTG 


GAGGCAACTA 


TGAi^ATGGCA 


AAACTTCTTA 


ATGGAACGGT 


TGTTCAAAAG 


960 


CATCCAATGC 


ATGCCGTAAC 


TTTTGTGGAT 


AATCACGATT 


CTCAACCTGG 


GGAATCATTA 


1020 


GAATCATTTG 


TACAAGAATG 


GTTTAAGCCA 


CTTGCTTATG 


CGCTTATTTT 


AACAAGAGAA 


lOSO 


CAAGGCTATC 


CCTCTGTCTT 


CTATGGTGAC 


TACTATGGAA 


TTCCAACACA 


TAGTGTCCCA 


1140 


GCAATGAAAG 


CCAAGATTGA 


TCCi^TCTTA 


GAGGCGCGTC 


AAAATTTTGC 


ATATGGAACA 


1200 


CAACATGATT 


ATTTTGACCA 


TCATAATATA 


ATCGGATGGA 


CACGTGAAGG 


AA?VTACCACG 


1260 


CATCCCAATT 


CAGGACTTGC 


GACTATCATG 


TCGGATGGGC 


CAGGGGGAGA 


GAAATGGATG 


1320 


TACGTAGGGC 


AAAfe^TAAAGC 


AGGTCAAGTT 


TGGCATGACA 


TA?^CTGGAAA 


TAAACCAGGA 


1380 


ACAGTTACGA 


TCAATGCAGA 


TGGATGGGCT 


AATTTTTCAG 


TAAATGGAGG 


ATCTGTTTCC 


1440 


ATTTGGGTGA 


AACGA 










1455 



30 

(2) INFORMATION FOR SEQ ID NO : 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1548 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DHA (genomic) 
fiii) Orga-niBiriT Bacillus gtearothermophilus 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: lit 



GCCGCACCGT 


TTAACGGCAC 


CATGATGCAG 


TATTTTGAAT 


GGTACTTGCC 


GGATGATGGC 


60 


ACGTTATGGA 


CCAAAGTGGC 


CAAI'GAAGCC 


AACAACTTAT 


CCAGCCTTGG 


CATCACCGCT 


120 


CTTTGGCTGC 


CGCCCGCTTA 


CAMGGAACA 


AGCCGCAGCG 


ACGTAGGGTA 


CGGAGTATAC 


180 


GACTTGTATG 


ACCTCGGCGA 


ATTCAATCAA 


AAAGGGACCG 


TCCGCACAAA 


ATACGGAACA 


240 


AAAGCTCA^T 


ATCTTCAAGC 


CATTCAAGCC 


GCCCACGCCG 


CTGGAATGCA 


AGTGTACGCC 


300 


GATGTCGTGT 


TCGACCATAA 


AGGCGGCGCT 


GACGGCACGG 


AATGGGTGGA 


CGCCGTCa^A 


3S0 


GTCAATCCGT 


CCGACCGCAA 


CCAAGAAATC 


TCGGGCACCT 


ATCAAFiTCCA 


AGCATGGACG 


420 


AAATTTGATT 


TTCCCGGGCG 


GGGCA^CACC 


TACTCCAGCT 


TTAAGTGGCG 


CTGGTACCAT 


460 


TTTGACGGCG 


TTGATTGGGA 


CGAAAGCCGA 


AAATTGAGCC 


GCATTTACAA 


ATTCCGCGGC 


540 


ATCGGCAAAG 


CGTGGGATTG 


GGAAGTAGAC 


ACGGAAAACG 


GAAACTATGA 


CTACTTAATG 


600 


TATGCCGACC 


TTGATATGGA 


TCATCCCGAA 


GTCGTGACCG 


AGCTGAAAAA 


CTGGGGGAAA 


€€0 


TGGTATGTCA 


ACACAACGAA 


CATTGATGGG 


TTCCGGCTTG 


ATGCCGTCAA 


GCATATTAAG 


720 


TTCAGTTTTT 


TTCCTGATTG 


GTTGTCGTAT 


GTGCGTTCTC 


AGACTGGCAA 


GCCGCTATTT 


780 
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ACCGTCGGGG 


AATATTGGAG 


CTATGACATC 


AACAAGTTGC 


ACAATTACAT 


TACGAAA?iCA 


840 




GACGGAACGA 


TGTCTTTGTT 


TGATGCCCCG 




AATTTTATAC 




300 




TCAGGGGGCG 


catttgatat 


GCGCACGTTA 


ATGACCAATA 


CTCTCATGAA 


AGATCAACCG 


960 




ACATTGGCCG 


T CAC CTT CGT 


TGATAATCAT 


GACAC CGAAC 


ccggccaagc 


GCTGCAGTCA 


1020 


10 


TGGGTCGACC 


CATGGTTCAA 


ACCGTTGGCT 


TACGCCTTTA 


TTCTAACTCG 


GCAGGAAGGA 


1090 


TACCCGTGCG 


TCTTTTATGG 


TGACTATTAT 


GGCATTCCAC 


AATATAACAT 


TCCTTCGCTG 


1140 




AAAAGCAAAA 


TCGATCCGCT 


CCTCATCGCG 


CGCAGGGATT 


ATGCTTACGG 


AACGCAACAT 


1200 


15 


GATTATCTTG 


ATCACTCCGA 


CATGATCGGG 


TGGACAAGGG 


AA.GGGGGCAC 


TGAAAAACCA 


1260 




GGATCCGGAC 


TGGCCGCACT 


GATCJACCGAT 


GGGCCGGGAG 


GAAGCAA^TG 


GATGTACGTT 


1320 


20 


GGCAAACAAC 


ACGCTGGAAA 


AGTGTTCTAT 


GACCTTACCG 


GCAACCGGAG 


TGACACCGTC 


1360 


ACCATCAACA 


GTGATGGATG 


GGGGGAATTC 


AAAGTCAATG 


GCGGTTCGGT 


TTCGGTTTGG 


1440 




GTTCCTAGAA 


AAACGACCGT 


TTCTACCATC 


GCTCGGCCGA 


TCACAACCCG 


ACCGTGGACT 


1500 


25 


GGTGAATTCG 


TCCGTTGGAC 


CGAACCACGG 


XTGGTGGCAT 


GGCCTTGA 




154S 



(2) INFORMATION FOR SEQ ID NO: 12; 
(i) SSQTJENCE CHARACTERISTICS: 
30 (A) LEr^GTH: 192 0 base pairs 

{E) TYPE; nucleic acid 

(C) STRANDEPNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
35 (iii) Organism: Bacillus lichenif ormis 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
(5) LOCATION: 421. .1872 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

40 

CGGAAGATTG GAAGTACAAA AATA?^GGAAA AGATTGTCA?^ TCATGTCATG AGCCATGCGG 60 
GAGACGGAAA AATCGTCTTA ATGCACGATA TTTATGCA^VC GTTCGCAGAT GCTGCTGAAG 12 0 

4 5 AGATTATTAA AAAGCTGAA?\ GCAAZ^AGGCT AtCAATTGGT AACTGTATCT CAGCTTGAAG 180 
AAGTGAAGAA GCAGAGAGGC TATTGAATAA ATGAGTAGAA GCGCCATATC GGCGCTTTTC 24 0 

TTTTGGAAGA AAATATAGGG AAAATGGTAC TTGTTAAZL?LA TTCGGAATAT TTATACAACA 3 00 

50 

TCATATGTTT CACATTGAAA GGGGAGGAGA ATCATGAAAC AACAAAAACG GCTTTACGCC 36 0 

CGATTGCTGA CGCTGTTATT TGCGCTCATC TTCTTGCTGC CTCATTCTGC AGCAGCGGCG 42 0 

53 GCA AAT CTT AAT GGG ACG CTG ATO CAG TAT TTT GAA TGG TAC ATG CCC 468 
AAT GAG GGC CAA CAT TGG AGG CGT TTG CAA AAC GAC TCG GCA TAT TTG 51^ 
GCT GAA CAC GGT ATT ACT GCC GTC TGG ATT CCC CCG GCA TAT AAG GGA 5^4 

SO 

ACG AGC CAA GCG GAT GTG GGC TAC GGT GCT TAC GAC CTT TAT GAT TTA 612 
GGG GAG TTT CAT CAA AAA CGC5 ACG GTT CGG ACA A^^G TAC GGC ACA AAA 660 
GGA GAG CTG CAA TCT GCG ATC AAA AGT CTT CAT TCC CGC GAC ATT AAC 70fi 
GTT TAC GGG GAT GTG GTC ATC! AAC CAC AAA GGC GGC GCT GAT GCG ACC 756 
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5 



15 



25 



35 



45 





GAT 


GTA 


ACC 


GCG 


GTT 


GAA 


GTC 


GAT 


CCC 


GCT 


GAC 


CGC 


AAC 


CGC 


GTA 


804 


ATT 


TCA 


GGA 


GAA 


CAC 


CTA 


ATT 


AAA 


GCC 


TGG 


ACA 


CAT 


TTT 


CAT 


TTT 


CCG 


eS2 


GQG 


CGC 


GGC 


AGC 


ACA 


TAC 


AGC 


GAT 


TTT 


AAA 


TGG 


CAT 


TGG 


TAC 


CAT 


TTT 


900 


GAC 


GGA 


ACC 


GAT 


TGG 


GAC 


GAG 


TCC 


CGA 


AAG 


CTG 


AAC 


CGC 


ATC 


TAT 


AAG 


94B 


TTT 


CAA 


GGA 


AAG 


GCT 


TGG 


GAT 


TGG 


GAA 


GTT 


TCC 


AAT 


GAA 


AAC 


GGC 


AAC 


996 


TAT 


GAT 


TAT 


TTG 


ATG 


TAT 


GCC 


GAC 


ATC 


GAT 


TAT 


GAC 


CAT 


CCT 


GAT 


GTC 


1044 


GCA 


GCA 


GAA 


ATT 


AAG 


AGA 


TGG 


GGC 


ACT 


TGG 


TAT 


GCC 


AAT 


GAA 


CTG 


CAA 


1092 


TTG 


GAC 


GGT 


TTC 


CGT 


CTT 


OAT 


GCT 


GTC 


AAA 


CAC 


ATT 


AAA 


TTT 


TCT 


TTT 


1140 


TTG 


CGG 


GAT 


TGG 


GTT 


AAT 


a^T 


GTC AGG 'GAA AAA 


ACG 


GGG 


AAG 


GAA 


ATG 


118B 


TTT 


ACG 


GTA 


GCT 


GAA 


TAT 


TGG 


CAG 


AAT 


GAC 


TTG 


GGC 


GCG 


CTG 


GAA 


AAC 


1236 


TAT 


TTG 


AAC 


AAA 


ACA 


AAT 


TTT 


AAT 


CAT 


TCA 


GTG 


TTT 


GAC 


GTG 


CCG 


CTT 


1264 


CAT 


TAT 


CAG 


TTC 


CAT 


GCT 


GCA 


TCG 


ACA 


CAG 


GGA 


GGC 


GGC 


TAT 


GAT 


ATG 


1332 


AGG 


AAA 


TTG 


CTG 


AAC 


GGT 


ACG 


GTC 


GTT 


TCC 


AAG 


CAT 


CCG 


TTG 


AAA 


TCG 


1380 


GTT 


ACA TTT 


GTC 


GAT 


AAC 


CAT 


GAT 


ACA 


CAG 


CCG 


GGG 


CAA 


TCG 


CTT 


GAG 


1428 


TCG 


ACT 


GTC 


CAA 


ACA 


TGG 


TTT 


AAG 


CCG 


CTT 


GCT 


TAC 


GCT 


TTT 


ATT 


CTC 


1476 


ACA 


AGG 


GAA 


TCT 


GGA 


TAC 


CCT 


CAG 


GTT 


TTC 


TAC 


GGG 


GAT 


ATG 


TAC 


GGG 


1524 


ACG 


AAA 


GGA 


GAC 


TCC 


CAG 


CGC 


GAA 


ATT 


CGT 


GCC 


TTG 


AAA 


CAC 


AAA 


ATT 


1572 


GAA 


CCG 


ATC 


TTA 


AAA 


GCG 


AGA 


AAA 


CAG 


TAT 


GCG 


TAC 


GGA 


OCA 


CAG 


CAT 


1620 


GAT 


TAT 


TTC 


GAC 


CAC 


CAT 


GAC 


ATT 


GTC 


GGC 


TGG 


ACA 


AGG 


GAA 


GGC 


GAC 


1668 


AGC 


TCG 


GTT 


GCA 


AAT 


TCA 


GGT 


TTG 


GCG 


GCA 


TTA 


ATA 


ACA 


GAC 


GGA 


CCC 


1716 


GGT 


GGG 


GCA 


AAG 


CGA 


ATG 


TAT 


GTC 


GGC 


CGG 


CAA 


AAC 


GCC 


GGT 


GAG 


ACA 


1764 


TGG 


CAT 


GAC 


ATT 


ACC 


GGA 


AAC 


CGT 


TCG 


GAG 


CCG 


GTT 


GTC 


ATC 


AAT 


TCG 


1S12 


GAA 


GGC 


TGG 


GGA 


GAG 


TTT 


CAC 


GTA 


AAC 


GGC 


GGG 


TCG 


GTT 


TCA 


ATT 


TAT 


1B60 



GTT CAA AGA TAG AAGAGCAGAG AGGACGGATT TCCTGAAGGA AATCCGTTTT 1912 
5 0 TTTATTTT 1S2 0 



(2) INFORMATION FOR SEQ ID HO: 13; 
55 (i) SEQUKNCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 

(B) TYPE! nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

60 Cii) MOLECUI^E TYPE: DNA {genomic) 

Ciii) Organism: Bacilluis sp. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 t 
CATCATAATG GAACAAATGG TACTATGATG CAATATTTCG AATGGTATTT GCCAAATGAC 60 

65 

GGGAATCATT GGAACAGGTT GAGGGATGAC GCAGCTAACT TAAAGAGTAA AGGGATAACA 12 0 
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GCTGTATGGA TCCCACCTGC ATGGAAGGGG ACTTCCCAGA ATGATGTAGG TTATGGAGCC ISO 

TATGATTTAT ATGATCTTGG AGA(5TTTAAC GAGAAGGGGA CGGTTCGTAC AAAATATGGA 240 

5 ACACGCAACC AGCTACAGGC TGa?GTGACC TCTTTAAAAA ATAACGGCAT XCAGGTATAT 3 00 

GGTGATGTCG TCATGAATCA TAAAGGTGGA GCAGATGGTA CGGAAATTGT AAATGCGGTA 360 

GAAGTGAATC GGAGCAACCG AAAGCAGGAA ACCTCAGGAG AGTATGCAAT AGAAGCGTGG 42 0 

10 

ACAAAGTTTG ATTTTCCTGG AAGAGGAAAT AACCATTCCA GCTTTAAGTG GCGCTGGTAT 48 0 

CATTTTGATG GGACAGATTG GGATCAGTCA CGCCAGCTTC AAAACAAAAT ATATAAATTC 54 0 

15 AGGGGAACAG GCAAGGCCTG GGACTGGGAA GTCGATACAG AGAATGGCAA CTATGACTAT 600 

CTTATGTATG CAGACGTGGA TATGGATCAC CCAGAAGTAJ!^ TACATGAACT TAGAAACTCG 660 

GGAGTGTGGT ATACGAATAC ACTGAACCTT GATGGATTTA GAATAGATGC AGTGAAACAT 72 0 

20 

ATAAAATATA GCTTTACGAG AGATTGGCTT ACACATGTGC GTAACACCAC AGGTAAACCA 790 

ATGTTTGCAG TGGCTGAGTT TTGGAAAAAT GACCTTGGTG CAATTGAAAA CTATTTGAAT 84 0 

25 AAAACAAGTT GGAATCACTC GGTGTTTGAT GTTCCTCTCC ACTATAATTT GTACAATGCA 5 00 

TCTAATAGCG GTGGTTATTA TGAIATGAGA AATATTTTAA ATGGTTCTGT GGTGCAAAAA 36 0 

CATCCAACAC ATGCCGTTAC TTTTGTTGAT AACCATGATT CTCAGCCCGG GGAAGCATTG 102 0 

3 0 

GAATCCTTTG TTCAACAATG GTTXAAACCA CTTGCATATG CATTGGTXCT GACAAGGGAA 108 0 

CAAGGTTATC CTTCCGTATT TTATGGGGAT TACTACGGTA TCCCAACCCA TGGTGTTCCG 114 0 

3 5 GGTATGAAAT CTAAAATAGA CCCTCTTCTG CAGGCACGTC AAACTTTTGC CTATGGrACG 12 0 0 

CAGCATGATT ACTTTGATCA TCATGATATT ATCGGTTGGA CAAGAGAGGG AAATAGCTCC 12 6 0 

CATCCAAATT CAGGCCTTGC CACCAXTATG TCAGATGGTC CAGGTGGTAA CAAATGGATG 132 0 

40 

TATGTGGGGA AAAATAAAGC GGGACAAGTT TGGAGAGATA TTACCGGAAA TAGGACAGGC 13 8 0 

ACCGTCACAA TTAATGCAGA CGGATGGGGT AATTTCTCTG TTAATGGAGG GTCCGTTTCG 144 0 

4 5 GTTTGGGTGA AGCAA ^^^^ 

(2) INFORMATION FOR SEQ ID NO; 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 1455 base pairs 
SO (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) Organism; Bacillus sp . 

55 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CATCATAATG GGACAAATGG GACGATGATG CAATACTTTG AATGGCACTT GCCTAATGAT 6 0 

GGGAATCACT GGAATAGATT AAGAGATGAT GCTAGTAATC TAAGAAATAG AGGTATAACC 12 0 

SO 

GCTATTTGGA TTCCGCCTGC CTGGi^AAGGG ACTTCGCAAA ATGATGTGGG GTATGGAGCC 18 0 

TATGATCTTT ATGATTTAGG GGAATTTAAT CAAAAGGGGA CGGTTCGTAC TAAGTATGGG 24 0 

65 ACACGTAGTC AATTGGAGTC TGCCATCCAT GCTTTAAAGA ATAATGGCGT TCAAGTTTAT 3 00 

GGGGATGTAG TGATGAACCA TAAAGGAGGA GCTGATGCTA CAGAAAACGT TCTTGCTGTC 3^0 
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GAGGTGA?VTC CAAATAACCG GAATCAAGAA ATATCTGGGG ACTACACAAT TGAGGCTTGG 42 0 

ACTAAGTTTG ATTTTCCAGG GAGGGGTAAT ACATACTCAG ACTTTAAATG GGGTTGGTAT 4 80 

5 

CATTTCGATG GTGTAGATTG GGATCAATCA CGACAATTCC AAAATCGTAT CTACAAATTC 54 0 

CGAGGTGATG GTAAGGCATG GGA':[TGGGAA GTAGATTCGG AAAATGGAAA TTATGATTAT 600 
10 TTAATGTATG CAGATGTAGA TATC5GATCAT CCGGAGGTAG TAAATGAGCT TAGAAGATGG 66 0 

GGAGAATGGT ATACAAATAC ATTAAATCTT GATGGATTTA GGATCGATGC GGTGAAGCAT 72 0 

ATTAAATATA GCTTTACACG TGATTGGTTG ACCCATGTAA GAAACGCAAC GGGAAAAGAA 7 60 

15 

ATGTTTGCTG TTGCTGAATT TTGGAAAAAT GATTTAGGTG CCTTGGAGAA CTATTTAAAT 840 
AAAACAAACT GGAATCATTC TGTCTTTGAT GTCCCCCTTC ATTATAATCT TTATAACGCG 90 0 

2 0 TCAAATAGTG GAGGCAACTA TGACATGGCA AAACTTCTTA ATGGAACGGT TGTTCAAAAG 96 0 

CATCCAATGC ATGCCGTAAC XTTl'GTGGAT AATCACGATT CTCAACCTGG GGAATCATTA 102 0 
GAATCATTTG TACAAGAATG GTTTAAGCCA CTTGCTTATG CGCTTATTTT AACAAGAGAA 10 8 0 

25 

CAAGGCTATC CCTCTGTCTT CTATGGTGAC TACTATGGAA TTCCAACACA TAGTGTCCCA 114 0 
GCAATGAAAG CCAAGATTGA TCCAATCTTA GAGQCGCGTC AAAATTTTGC ATATGGAACA 1200 

3 0 CAACATGATT ATTTTGACCA TCATAATATA ATCGGATGGA CACGTGAAGG AAATACCACG 126 0 

CATCCCAATT CAGGACTTGC GACTATCATG TCGGATGGGC CAGGGGGAGA GAAATOGATG 1320 
TACGTAGGGC AAAATAAAGC AGGTCAAGTT TGGCATGACA TAACTGGAAA TAAACCAGGA 13 8 0 

35 

ACAGTTACGA TCAATGCAGA TGGATGGGCT AATTTTTCAG TAAATGGAGG ATCTGTTTCC 14 4 0 
ATTTGGGTGA AACGA 1455 

40 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQTJEiTCE CHARACTmiSTlCS : 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic aCid 

4 5 (C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) KAME/ KEY: mi Bc - feature : 
50 (B) OTHER INFORMATION: /desc = "RSERI" 

(ix) FEATURE: 

(A) NAME/KEY; misc-f esLture 
(B) LOCAXXON: 21-62 

(n) : OTHER INFORMATION: /Not:e=. 1,-97%A, 1%T, 1%C, 1%G 
55 2:97%T, 1%A, 1%C, 1%G 

3:97%C, 1%A, 1%T, 1%G 
4:97%G, 1%A, 1%T, 1%C 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCGTTTTGCC GGCCGACATA 3122343222 4333313344 
60 4233423242 2122112433 43CAA?kCCTG AATT 74 



(2) INFORMATION FOR SEQ ID NO t 16: 
(i) SEQUENCE CHARACTERISTICS: 
65 (A) LENGTH: 122 base pairs 

(B) TYPE: nucleic; acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE; other nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY I misc-f eature : 
5 (B) OTHER INFORMATION: /desc - "I^SERII" 

(ix) FEATURE: 

(A) ITAME/KEY: misc-feature 
(E) LOCATION: ^3-104 

(B) : OTHER INFORMATION: /Rote= 1:97%A, X%T, 1%C, 1%G 
10 2:97%T, I%A. 1%C, 1%G 

3r97%C. 1%A, 1%T, 1%G 
4:97%G; 1%A, 1%T, 1%C 

(Xi) SEQUENCE DESCRIPTION: SEQ XD NO: 16 x 

GCGTTTTGCC GGCCGACATA CATTCGCTTT GCCCCACCGG GTCCGTCTGT 
15 TATTAATGCC GC31113324 1122243113 3414324234 3322333224 

23 31GCCGAC AATGTCATGG TG 122 

(2) INFORMATION FOR SEQ ID NO: 17; 
(i) SEQUENCE CEARACTERXSTICS : 
20 (A) LENGTH: 7 8 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
25 (ix) FEATURE: 

(A) NAME/ KEY: misc - feature : 

(B) OTHER INFORMATION: /desc =: "RSERIII" 
(ix) FEATURE: 

(A) NAME/KEY: miec- feature 
30 (B) LOCATION: 19-60 

(D) : OTHER INFORMATION: /Ncte^ 1:97%A, 1%T, 1%C, 1%G 

2:97%T. 1%A, 1%C, 1%G 
3;97%C, 1%A, 1%T, 1%G 
4 * 97%G 1 A 1 ^T 1%C 
3 5 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: I?'; 

GTCGCCTTCC CTTGTCCA43 3413112423 1244244234 1112112312 
4324243233 GTACGCATAC TGTTTTCT 78 

40 (2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(E) TYPE: nucleic acid 
(C) STRANDEDNESS : single 

45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(ix) FEATURE: 

(A) NAME/ KEY: misc -feature : 

(B) OTHER INFORMATION: /desc = "FSERIII" 
5 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 18: 

TGGACAAGGG AAGGCGACAG 2 0 

(2) INFORMATION FOR SEQ ID NO : 19: 
Ci) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
^0 (ix) FEATURE: 

(A) NAME/ KEY: misc-f eature : 

(B) OTHER INF0:RMATI0N; /desc = "RSBRV" 
(ix) FEATURE; 

CA) NAME/KEY: misc - feature 
65 (E) LOCATION: 19-^0 

(D) : OTHER INFORMATION: /Noce- 1;97%A, 1%T, 1%G, 1%G 

2:97%T; 1%A, 1%C, 1%G 
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4:97%G, 1%A, 1%T, 1%C 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 15: 

TAAGATCGGT TCAATTTT42 4222311443 1441122234 5432444142 
5 3233222342 CCCGTACATA TCCCCGTAGA A 



10 



15 



20 



(2) INFORMATION FOR SEQ ID NO : 20: 
(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH; 18 base pairs 

(B) TYPE; nucleic acid 

(C) STRAKDEDNESS ! single 
CD] TOPOLOGY: linear 

(ii) MOLECULE TYPE; other nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY; misc-f eature : 

(B) OTHER INFORMATION: /desc 
(xi) SEQUENCE DESCRIPTION;: SEQ ID NO- 20: 
AAAATTGAAC CGATCTTA 



"FSERV" 



16 



/desc = ^'FSERVII' 



(2) INFORMATION FOR SEQ ID NO : 21: 
(i) SEQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 107 base pairs 
25 (B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Other nucleic acid 
( ix) FEATURE ; 

30 (A) NAME/ KEY: misc- feature : 

(B) OTHER INFORMATION: 
(ix) FEATURE; 

(A) NAME/KEY: inisc-f eature 
(E) LOCATION: 54-95 
35 (D) : OTHER INFORMATION: /Note= 1:97%A, 1%T, 1%C, 1%G 

2:97%T, 1%A, 1%C, 1%G 
3:^7%Cr 1%A, 1%T, 1%G 
4:97%G. 1%A, 1%T, 1%C 
{xi) SEQUSNCE DESCRIPTION: SEQ ID NO; 21: 
4 0 TTCCATGCTG CATCGACACA GGGAGGCGGC TATGATATGA GGAAATTGCT 
GAA3442134 4234222331 1431233422 4111234422 13122TGTCG 
ATAACCA 



lOS 



45 (2) INFORMATION FOR SEQ ID NO ; 22: 
(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 16 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ixj FEATURE: 

(A) NAME/ KEY; misc- feature : 

(B) OTHER INFORMATION: /desc ^ "RSERVJI" 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

TGTCGATGCA GCATGGAA 19 



(2) INFORMATION FOR SEQ ID NO: 23: 
60 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: SO base pairs 
(E) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

6S (ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY: misc - feature : 
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(B) OTHER INFORMATION: /desc = "FSERIX" 
( ix) FEATURE : 

(A) NAME/KEY: misc-f <iature 
(B) LOCATION: 21-62 
5 (D) : OTHER INFORMATION: /.tJote= 1:97%A, 1%T, 1%C, 1%G 

2;97%T, 1%A, 1%C, 1%G 
3:97%C, 1%A, 1%T, 1%G 
4:97%G. 1%A, 1%T, 1%C 
(Xi> SEQUENCE DESCRIPTION: SEQ ID NO; 23: 
10 GTCCAAACAT GGTTTAAGCC 4322432213 4322221223 2313114441 

1232441213 33TCAGGTTT TCTACGGGGA 8 0 

(2) INFORMATION FOR SEQ ID NO : 24: 
(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 2 0 base pairs 

(B) TYPEt nucleic acid 

(C) STRANBEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
20 (ix) FEATURE; 

(A) NAME/ KEY: misc- f eat-Ure : 

(B) OTHER INFORMATION: /desc = "RSERIX" 
SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GGCTTAAACC ATGTTTGGAC 2 0 

25 

(2) INFORMATION FOR SEQ ID NO : 26: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 

(E) TYPE: nucleic acid 
30 (G) STRANDEDNESS : Single 

(DJ TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY: misc- feature : 
35 (B) OTHER INFORMATION: /desc = "Primer 13'' 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

CGATTGCTGA CGCTGTTATT TGCG 24 

(2) INFORMATION FOR SEQ ID NO : 27: 
40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE; 

(A) NAME/ KEY: mi sc - feature : 

(B) OTHER INFORI^TION: /desc = '^Primer #63" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 27: 
SO CTATCTTTGA ACATAAATTG AAACC 2 5 

(2) INFORMATION FOR SEQ ID NO : 28: 
(iJ SEQUENCE CHARACTERISTICS; 
55 (A) LENGTH: 2 0 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
eo (ix) FEATURE: 

(A) NAME/ KEY: mi BC- feature : 

(B) OTHER INF0:RMATI0N: /desc ^ »' forward Primerl*^ 
(XI ) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

gacctgcagt caggcaacta 20 



65 



(2) INFORMATION FOR SEQ ID NO: 29 j 
(i) SEQUENCE CHARACTERISTICS; 
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(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOX/OGy: linear 

S Cii) MOLECUI.E TYPE: other nucleic acid 
(ix) FEATURE 7 

(A) NAME/ KEY: misc - feature : 

(B) OTHER INFORMATION: /deec = "reverse primer 1" 
(xo.) SEQUENCE DESCRIPTION: SEQ ID NO; 29: ^ ^ 

10 tagagtcgac ctgcaggcat 20 

(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 
^5 (B) TYPE; nucleic acid 

(C) STRANDEDNESis : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: othej: nucleic acid 
(ix) FEATURE: 
20 (A) NAME/ KEY: miec-f eature : 

, OTHER INFORMATION: /desc = "forward primer 2" 

{XI) SEQUENCE DESCRIPTION; SEQ ID NO: 30* P^^imer 2 

gaccrgcagt caggcaacta 2 0 

25 C2) INFORMATION FOR SEQ ID NO ; 31: 
(i) SEQUENCE CHAE^CTERISTICS : 
CA) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
3 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; ochcr nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY; miac-f eature : 

CB) OTHER INFORMATION: /desc = "reverse urimer 2" 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO ; 3 1 : ^^verse primer 2 

tagagtcgac ctgcaggcat 2 0 

[2) INFORMATION FOR SEQ ID NO ; 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) Organism: Bacillus amyloliquef aciens 
(IX) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 343. .1794 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

GCCCCGCACA TACGAAAAGA CTGGCTGAAA ACATTGAGCC TTTGATGACT GATGATTTGG 6 0 

CTGAAGAAGT GGATCGATTG TTTGAGAAAA GAAGAAGACC ATAAA?^TAC CTTGTCTGTC 12 0 

ATCAGACAGG GTATTTTTTA TGCTGTCCAG ACTGTCCGCT GTGTAAAAAT AAGGAATAAA 160 

GGGGGGTTGT TATTATTTTA CTGATATGTA AAATATAATT TGTATAAGAA AATGAGAGGG 24 0 

eo AGAGGAAACA TGATTCAAAA ACGAi^GCGG ACAGTTTCGT TCAGACTTGT GCTTATGTGC 300 

ACGCTGTTAT TTGTCAGTTT GCCGATTACA AAAACATCAG CC GTA AAT GGC ACG 354 

CTG ATG CAG TAT TTT GAA TGCJ TAT ACG CCG AAC GAC GGC CAG CAT TGG 402 

AAA CCA TTG CAG AAT GAT GCG GAA CAT TTA TCG GAT ATC GGA ATC ACT 45 0 
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GCC GTC TGG ATT CCT CCC GCA TAC AAA GGA TTG AGC CAA TCC GAT AAC 4 9$ 

GGA TAC GGA CCT TAT GAT TTG TAT GAT TTA GGA GAA TTC CAG CAA AAA 54 6 

S GGG ACG GTC AGA ACG AAA TAC GGC AC A AAA TCA GAG CTT CAA GAT GCG 594 

ATC GGC TCA CTG CAT TCC CGG AAC GTC CAA GTA TAC GGA GAT GTG GTT 642 

TTG AAT CAT AAG GCT GGT GC:T GAT GCA ACA GAA GAT GTA ACT GCC GTC ^9 0 

10 

GAA GTC AAT CCG GCC AAT AGA AAT CAG GAA ACT TCG GAG CAA TAT CAA 73 8 

ATC AAA GCG TGG ACG GAT TT'T CGT TTT CCG GGC CGT GGA AAC ACG TAC 79 e 

15 AGT GAT TTT AAA TGG CAT TGG TAT CAT TTC GAC GGA GCG GAC TGG GAT 834 

GAA TCC CGG AAG ATC AGC CGC ATC TTT AAG TTT CGT GGG GAA GGA AAA 3 62 

GCG TGG GAT TGG GAA GTA TCA AGT GAA AAC GGC AAC TAT GAC TAT TTA 93 0 

20 

ATG TAT GCT GAT GTT GAC TAC GA.C CAC CCT GAT GTC GTG GCA GAG ACA 97 8 

AAA AAA TGG GGT ATC TGG TAT GCG AAT GAA CTG TCA TTA GAC GGC TTC 102 S 

2 5 CGT ATT GAT GCC GCC AAA CAT ATT AAA TTT TCA TTT CTG CGT GAT TGG 1074 

GTT CAG GCG GTC AGA CAG GCG ACG GGA AAA GAA ATG TTT ACG GTT GCG 1122 

GAG TAT TGG CAG AAT AAT GCC GGG AAA CTC GAA AAC TAC TTG AAT AAA 1170 

30 

ACA AGC TTT AAT CAA TCC GTG TTT GAT GTT CCG CTT CAT TTC AAT TTA 1218 

CAG GCG GCT TCC TCA CAA GGA GGC GGA TAT GAT ATG AGG CGT TTG CTG 12^6 

35 GAC GGT ACC GTT GTG TCC AGG CAT CCG GAA AAG GCG GTT ACA TTT GTT 1314 

GAA AAT CAT GAC ACA CAG CC<5 GGA CAG TCA TTG GAA TCG ACA GTC CAA 13 62 

ACT TGG TTT AAA CCG CTT GCA TAC GCC TTT ATT TTG ACA AGA GAA TCC 1410 

40 

GGT TAT CCT CAG GTG TTC TAT GGG GAT ATG TAC GGG ACA AAA GGG ACA 14 5 S 

TCG CCA AAG GAA ATT CCC TQA CTG AAA GAT AAT ATA GAG CCG ATT TTA 150 6 

4 5 AAA GCG CGT AAG GAG TAC GCA TAC GGG CCC CAG CAC GAT TAT ATT GAC 1554 

CAC CCG GAT GTG ATC GGA TGG ACG AGG GAA GGT GAC AGC TCC GCC GCC 1602 

AAA TCA GGT TTG GCC GCT TTA ATC ACG GAC GGA CCC GGC GGA TCA AAG 16 SO 

50 

CGG ATG TAT GCC GGC CTG AAA AAT GCC GGC GAG ACA TGG TAT GAC ATA 1698 

ACG GGC AAC CGT TCA GAT ACT GTA AAA ATC GGA TCT GAC GGC TGG GGA 174 6 

55 GAG TTT CAT GTA AAC GAT GGG TCC GTC TCC ATT TAT GTT CAG AAA TAA 17 94 

GGTAATAAAA AAACACCTCC AAGCTGAGTG CGGGTATCAG CTTGGAGGTG CGTTTATTTT 1854 

TTCAGCCGTA TGACAAGGTC GGCATCAGGT GTGACAAATA CGGTATGCTG GCTGTCATAG 1514 

€0 

GTGACAAATC CGGGTTTTGC GCCGTTTGGC TTTTTCACAT GTCTGATTTT TGTATAATCA 1374 

ACAGGCACGG AGCCGGAATC TTTCGCCTTG GAAAAATAAG CGGCGATCGT AGCTGCTTCC 2 034 

€5 AATATGGATT GTTCATCGGG ATCGCTGCTT TTAATCACAA CGTGGGATCC 2084 
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